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(57) Abstract 

The present invention relates to a fatty acid 
A5-desaturase able to catalyze the conversion of 
dihomo-gamma-linolenic acid to arachidonic acid. 
Nucleic acid sequences encoding A5-desaturase, nucleic 
acid sequences which hybridize thereto, DNA constructs 
comprising a A5~desaturase gene, and recombinant 
host microorganism or animal expressing increased 
levels of a A5-desaturase are described. Methods for 
desaturating a fatty acid at the A 5 position and for 
producing arachidonic acid by expressing increased 
levels of a A5-desaturase are disclosed. Fatty acids, 
and oils containing them, which have been desaturated 
by a A5-desaturase produced by recombinant host 
microorganisms or animals are provided. Pharmaceutical 
compositions, infant formulas or dietary supplements 
containing fatty acids which have been desaturated 
by a A5-desaturase produced by a recombinant host 
microorganism or animal also are described. 
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METHODS AND COMPOSITIONS FOR SYNTHESIS OF 
LONG CHAIN POLYUNSATURATED FATTY ACIDS 

RELATED APPLICATION 

S This application is a continuation in part application of Serial Number 

08/833,610 filed April 11, 1997. 

INTRODUCTION 

Field of the Invention 

This invention relates to modulating levels of enzymes and/or enzyme 
10 components relating to production of long chain poly-unsaturated fatty acids 
(PUFAs) in a microorganism or animal. 



Background 

Two main families of polyunsaturated fatty acids (PUFAs) are the co3 
15 fatty acids, exemplified by eicosapentaenoic acid (EPA), and the co6 fatty acids, 
exemplified by arachidonic acid (ARA). PUFAs are important components of 
the plasma membrane of the cell, where they may be found in such forms as 
phospholipids. PUFAs are necessary for proper development, particularly in the 
developing infant brain, and for tissue formation and repair. PUFAs also serve 
20 as precursors to other molecules of importance in human beings and animals, 
including the prostacyclins, eicosanoids, leukotrienes and prostaglandins. 

Four major long chain PUFAs of importance include docosahexaenoic 
acid (DHA) and EPA, which are primarily found in different types of fish oil, 
gamma-linolenic acid (GLA), which is found in the seeds of a number of plants, 
25 including evening primrose {Oenothera biennis), borage {Borago officinalis) 
and black currants (Ribes nigrum), and stearidonic acid (SDA), which is found 
in marine oils and plant seeds. Both GLA and another important long chain 
PUFA, arachidonic acid (ARA), are found in filamentous fungi. ARA can be 
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purified from animal tissues including liver and adrenal gland. GLA, ARA, 
EPA and SDA are themselves, or are dietary precursors to, important long chain 
fatty acids involved in prostaglandin synthesis, in treatment of heart disease, 
and in development of brain tissue. 

S Polyunsaturated fatty acids have a number of pharmaceutical and 

medical applications including treatment of heart disease, cancer and arthritis. 

For DHA, a number of sources exist for commercial production 
including a variety of marine organisms, oils obtained from cold water marine 
fish, and egg yolk fractions. For ARA, microorganisms including the genera 

1 0 Mortierella, Entomophthora, Phytium and Porphyridium can be used for 
commercial production. Commercial sources of SDA include the genera 
Trichodesma andEchium. Commercial sources of GLA include evening 
primrose, black currants and borage. However, there are several disadvantages 
associated with commercial production of PUF As from natural sources. Natural 

1 5 sources of PUF As, such as animals and plants, tend to have highly 

heterogeneous oil compositions. The oils obtained from these sources therefore 
can require extensive purification to separate out one or more desired PUF As or 
to produce an oil which is enriched in one or more PUF A. Natural sources also 
are subject to uncontrollable fluctuations in availability. Fish stocks may 

20 undergo natural variation or may be depleted by overfishing. Fish oils have 

impleasant tastes and odors, which may be impossible to economically separate 
from the desired product, and can render such products unacceptable as food 
supplements. Animal oils, and particularly fish oils, can accumulate 
environmental pollutants. Weather and disease can cause fluctuation in yields 

25 from both fish and plant sources. Cropland available for production of alternate 
oil-producing crops is subject to competition from the steady expansion of 
human populations and the associated increased need for food production on the 
remaining arable land. Crops which do produce PUFAs, such as borage, have 
not been adapted to commercial grov^h and may not perform well in 

30 monoculture. Growth of such crops is thus not economically competitive where 
more profitable and better established crops can be grown. Large scale 
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fermenlation of organisms such as Mortierella is also expensive. Natural 
animal tissues contain low amounts of ARA and are difficult to process. 
Microorganisms such as Porphyridium and Mortierella are difficult to cultivate 
on a commercial scale. 

S Dietary supplements and pharmaceutical formulations containing 

PUFAs can retain the disadvantages of the PUFA source. Supplements such as 
fish oil capsules can contain low levels of the particular desired component and 
thus require large dosages. High dosages result in ingestion of high levels of 
undesired components, including contaminants. Unpleasant tastes and odors of 

1 0 the supplements can make such regimens undesirable, and may inhibit 
compliance by the patient. Care must be taken in providing fatty acid 
supplements, as overaddition may result in suppression of endogenous 
biosynthetic pathways and lead to competition with other necessary fatty acids 
in various lipid fractions in vivo^ leading to undesirable results. For example, 

1 S Eskimos having a diet high in ct>3 fatty acids have an increased tendency to 
bleed (U.S. Pat. No. 4,874,603). 

A number of enzymes are involved in PUFA biosynthesis. Linolenic 
acid (LA, 18:2 A9, 12) is produced from oleic acid (18:1 A"") by a A12- 
desaturase. GLA (18:3 A6, 9, 12) is produced from linoleic acid (LA, 18:2 A9, 

20 12) by a A6-desaturase. ARA (20:4 A5, 8, 11, 14) production from dihomo- 
gamma-linolenic acid (DGLA, 20:3 A8, 1 1, 14) is catalyzed by a A5-desaturase. 
However, animals cannot desaturate beyond the A9 position and* therefore 
cannot convert oleic acid (18:1 A9) into linolenic acid (18:2 A912). Likewise, 
a-linoleic acid (ALA, 18:3 A9, 12, 15) cannot be synthesized by mammals. 

25 Other eukaryotes, including fungi and plants, have enzymes which desaturate at 
positions A12 and A15. The major poly-unsaturated fatty acids of animals 
therefore are either derived fi'om diet and/or from desaturation and elongation of 
linoleic acid (18:2 A9, 12)or oc-linolenic acid (18:3 A9, 12, 15). Therefore it is 
of interest to obtain genetic material involved in PUFA biosynthesis from 

30 species that naturally produce these fatty acids and to express the isolated 
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material in a microbial or animal system which can be manipulated to provide 
production of commercial quantities of one or more PUFAs. Thus there is a 
need for fatty acid desaturases, genes encoding them, and recombinant methods 
of producing them. A need further exists for oils containing higher relative 
5 proportions of and/or enriched in specific PUFAs. A need also exists for 
reliable economical methods of producing specific PUFAs. 

Relevant Literature 

Production of gamma-linolenic acid by a A6-desaturase is described in 
USPN 5,552,306. Production of 8, 1 1 -eicosadienoic acid using Mortierella 

1 0 alpina is disclosed in USPN 5,376,54 1 . Production of docosahexaenoic acid by 
dinoflagellates is described in USPN 5,407,957. Cloning of a A6-palmitoyl- 
acyl carrier protein desaturase is described in PCT publication WO 96/13591 
and USPN 5,614,400. Cloning of a A6-desaturase from borage is described in 
PCT publication WO 96/21022. Cloning of A9-desaturases is described in the 

1 5 published patent applications PCT WO 91/13972, EP 0 550 162 Al, EP 0 561 
569 A2, EP 0 644 263 A2, and EP 0 736 598 Al, and in USPN 5,057,419. 
Cloning of A12-desaturase5 from various organisms is described in PCT 
publication WO 94/1 1516 and USPN 5,443,974. Cloning of A15-desaturases 
from various organisms is described in PCT publication WO 93/1 1245. All 

20 publications and U.S. patents or applications referred to herein are hereby 
incorporated in their entirety by reference. 

Summary of the Invention 

Novel compositions and methods are provided for preparation of poly- 
unsaturated long chain fatty acids or PUFAs. The compositions include nucleic 
25 acids encoding a A5-desaturase and/or polypeptides having A5-desaturase 

activity, the polypeptides, and probes for isolating and detecting the same. The 
methods involve growing a host microorganism or animal which contains and 
expresses one or more transgenes encoding a A5-desaturase and/or a 
polypeptide having A5-desaturase activity. Expression of the desaturase 
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polypeptide provides for a relative increase in AS-desatvirated PUFA, or 
metabolic progeny therefrom, as a result of altered concentrations of enzymes 
and substrates involved in PUFA biosynthesis. The invention finds use for 
example in the large scale production of PUFA containing oils v^hich include, 
5 for example, ARA, EPA and/or DHA. 

In a preferred embodiment, a nucleic acid sequence comprising a A5- 
desaturase depicted in Figure 3A-D (SEQ ID NO 1), a polypeptide encoded by 
the nucleic acid, and a purified or isolated polypeptide depicted in Figure 3 A-D 
(SEQ ID NO: 2), and an isolated nucleic acid encoding the polypeptide of 

1 0 Figure 3 A-D (SEQ ID NO: 2) are provided. Another embodiment of the 
invention is an isolated nucleic acid sequence which encodes a polypeptide, 
wherein said polypeptide desaturates a fatty acid molecule at carbon S from the 
carboxyl end of the molecule. The nucleic acid is preferably derived from a 
eukaryotic cell, such as a fiingal cell, or a fungal cell of the genus Moriierellay 

15 or of the genus/species Morderella alpina. Also preferred is an isolated nucleic 
acid comprising a sequence which anneals to a nucleotide sequence depicted in 
Figure 3A-3D (SEQ ID NO: 1), and a nucleic acid which encodes an amino acid 
sequence depicted in Figure 3A-D (SEQ ID NO: 2). In particular, the nucleic 
acid encodes an amino acid sequence depicted in Figure 3A-D (SEQ ID NO: 2) 

20 which is selected from the group consisting of amino acid residues 30-38, 41- 
44, 171-175, 203-212, and 387-394. In an additional embodiment, the invention 
provides an isolated or purified polypeptide which desaturates a fatty acid 
molecule at carbon 5 from the carboxyl end of the molecule. Also provided is 
an isolated nucleic acid sequence which hybridizes to a nucleotide sequence 

25 depicted in Figure 3 A-D (SEQ ID NO 1), an isolated nucleic acid sequence 
having at least about 50% identity to Figure 3 A-D (SEQ ID NO 1). 

The present invention further includes a nucleic acid construct 
comprising a nucleotide sequence depicted in a Figure 3 A-D (SEQ ID NO: 1) 
linked to a heterologous nucleic acid; a nucleic acid construct comprising a 
30 nucleotide sequence depicted in a Figure 3 A-D (SEQ ID NO: 1) operably linked 
to a promoter; and a nucleic acid construct comprising a nucleotide sequence 

-5- 
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depicted in a Figure 3A-D (SEQ ID NO: 1) operably linked to a promoter which 
is functional in a microbial cell. In a preferred embodiment, the microbial cell 
is a yeast cell, and the nucleotide sequence is derived from a fungus, such as a 
fungus of the genus Mortierella^ particularly a fungus of the species Mortierella 
5 alpina. 

In another embodiment of the invention, a nucleic acid construct is 
provided which comprises a nucleotide sequence which encodes a polypeptide 
comprising an amino acid sequence which corresponds to or is complementary 
to an amino acid sequence depicted in Figure 3A-D (SEQ ID NO: 2), wherein 

1 0 the nucleotide sequence is operably linked to a promoter which is functional in 
a host cell, and wherein the nucleotide sequence encodes a polypeptide which 
desaturates a fatty acid molecule at carbon 5 from the carboxyl end of a fatty 
acid molecule. Additionally, provided by the invention is a nucleic acid 
construct comprising a nucleotide sequence which encodes a functionally 

1 5 active AS-desaturase, where the desaturase includes an amino acid sequence 
which corresponds to or is complementary to all of or a portion of an amino 
acid sequence depicted in a Figure 3 A-D (SEQ ID NO: 2), wherein the 
nucleotide sequence is operably linked to a promoter functional in a host cell. 

The invention also includes a host cell comprising a nucleic acid 
20 construct of the invention. In a preferred embodiment, a recombinant host cell 
is provided which comprises at least one copy of a DNA sequence which 
encodes a functionally active Mortierella alpina fatty acid desaturase having an 
amino acid sequence as depicted in Figure 3A-D (SEQ ID NO: 2), wherein the 
cell or an ancestor of the cell was transformed with a vector comprising said 
25 DNA sequence, and wherein the DNA sequence is operably linked to a 
promoter. The host cell is either eukaryotic or prokaryotic. Preferred 
eukaryotic host cells are those selected from the group consisting of a 
manunalian cell, an insect cell, a fungal cell, and an algae cell. Preferred 
mammalian cells include an avian cell, a fungal cell such as a yeast, and a 
30 marine algae cell. Preferred prokaryotic cells include those selected from the 
group consisting of a bacteria, a cyanobacteria, cells which contain a 
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bacteriophage, and/or a virus. The DNA sequence of the recombinant host cell 
preferably contains a promoter which is functional in the host cell. 

The host cells of the invention v^hich contain the DNA sequences of the 
invention are enriched for fatty acids, such as 20:3 fatty acids. In a preferred 

5 embodiment, the host cells are enriched for 20:4 fatty acids as compared to an 
untransformed host cell which is devoid of said DNA sequence, and/or enriched 
for 20:5 fatty acids compared to an untransformed host cell which is devoid of 
said DNA sequence. In yet another preferred embodiment, the invention 
provides a recombinant host cell which comprises a fatty acid selected from the 

1 0 group consisting of a dihomo-y-linolenic acid, n-6 eicosatrienoic acid, 20:3n-6 
acid and 20:3 (8,11,14) acid. 

The present invention also includes method for production of 
arachidonic acid in a microbial cell culture, where the method comprises 
growing a microbial cell culture having a pltirality of microbial cells which 

1 5 contain one or more nucleic acids encoding a polypeptide which converts 
dihomo-y-linolenic acid to arachidonic acid, wherein the nucleic acid is 
operably linked to a promoter, under conditions whereby said one or more 
nucleic acids are expressed, whereby arachidonic acid is produced in the 
microbial cell culture. In several preferred embodiments of the invention, the 

20 polypeptide is an enzyme which desaturates a fatty acid molecule at carbon 5 
from the carboxyl end of the fatty acid molecule; the nucleic acid is derived 
from a Mortierella sp.; and the substrate for said polypeptide is exogenously 
supplied. The microbial cells used in the methods can be either eukaryotic 
cells or prokaryotic cells. The preferred eukaryotic cells are those selected 

25 from the group consisting of a mammalian cell, an insect cell, a fungal cell, and 
an algae cell. Preferred mammalian cells include an avian cell, a preferred 
fungal cell is a yeast, and the preferred algae cell is a marine algae cell. The 
preferred prokaryotic cells include those selected from the group consisting of a 
bacteria, a cyanobacteria, cells which contain a bacteriophage, and/or a virus. 

30 The nucleic acid sequence encoding the polypeptide of the microbial ceil 
preferably contains a promoter which is functional in the host cell which 
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optionally is an inducible promoter for example by components of the culture 
broth. The preferred microbial cells used in the methods are yeast cells, such 
as Saccharomyces cells. 

In another embodiment of the invention, a recombinant yeast cell is 
5 provided which converts greater than about 5% of 20:3 fatty acid substrate to a 
20:4 fatty acid product. 

Also provided is an oil comprising one or more PUFA. The amount of 
said one or more PUFAs is approximately 0.3-30% arachidonic acid (ARA), 
approximately 0.2-30% dihomo-y-linolenic acid (DGLA), and approximately 

1 0 0.2-30% y-linolenic acid (GLA). A preferred oil of the invention is one in 
which the ratio of ARA:DGLA:GLA is approximately 1 .0:19.0:30 to 
6.0:1 .0:0.2. Another preferred embodiment of the invention is a pharmaceutical 
composition comprising the oils in a pharmaceutically acceptable carrier. 
Further provided is a nutritional composition comprising the oils of the 

1 5 invention. The nutritional compositions of the invention preferably are 
administered to a mammalian host parenterally or internally. A preferred 
composition of the invention for internal consumption is an infant formula. In a 
preferred embodiment, the nutritional compositions of the invention are in a 
liquid form or a solid form. 

20 The present invention also includes a method for desaturating a fatty 

acid, where the method comprises culturing a recombinant microbial cell of the 
invention under conditions suitable for expression of a polypeptide encoded by 
the nucleic acid, wherein the host cell further comprises a fatty acid substrate of 
the polypeptide. In a preferred embodiment, a fatty acid desaturated by the 

25 methods is provided, including an oil comprising the fatty acid. 

The present invention is also directed to purified nucleotide and peptide 
sequences presented in SEQ ID NO: 1-34. The present invention is further 
directed toward methods of using the sequences presented in SEQ ID NO: 1-34 
as probes to identify related sequences, as components of expression systems 
30 and as components of systems useful for producing transgenic oil. 
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The present invention is further directed to methods of obtaining ahered 
long chain poly unsaturated fatty acid biosystems by growing transgenic 
microbes which encode transgene expression products which desaturate a fatty 
acid molecule at carbon 5 from the carboxyl end of the fatty acid molecule. 

5 The present invention is further directed to formulas, dietary 

supplements or dietary supplements in the form of a liquid or a solid containing 
the long chain fatty acids of the invention. These formulas and supplements 
may be administered to a human or an animal. 

The formulas and supplements of the invention may further comprise at 
10 least one macronutrient selected from the group consisting of coconut oil, soy 
oil, canola oil, mono- and diglycerides, glucose, edible lactose, electrodialysed 
whey, electrodialysed skim milk, milk whey, soy protein, and other protein 
hydrolysates. 

The formulas of the present invention may iurther include at least one 
1 5 vitamin selected from the group consisting of Vitamins A, C, D, E, and B 
complex; and at least one mineral selected from the group consisting of 
calcium, magnesium, zinc, manganese, sodium, potassium, phosphorus, copper, 
chloride, iodine, selenium, and iron. 

The present invention is further directed to a method of treating a patient 
20 having a condition caused by insuffient intake or production of polyunsaturated 
fatty acids comprising administering to the patient a dietary substitute of the 
invention in an amount sufficient to effect treatment of the patient. 

The present invention is further directed to cosmetic and pharmaceutical 
compositions of the material of the invention. 

25 The present invention is also directed to an isolated nucleotide sequence 

comprising a nucleuotide sequence selected from the group consisting of: SEQ 
IDNO:13; SEQIDN0:15; SEQIDN0:17; SEQ1DN0:19; SEQIDNO:21; 
SEQ ID NO:22; SEQ ID NO:22; SEQ ID NO:24; SEQ ID NO:25; SEQ ID 
NO:26 and SEQ ID NO:27. 



9 
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The present invention is also directed to an isolated peptide sequence 
comprising a peptide sequence selected from the group consisting of: SEQ ID 
NO:14; SEQIDN0:16; SEQIDNO:18; SEQIDNO:20; SEQIDNO:28; 
SEQIDNO:29; SEQIDNO:30; SEQIDN0:31; SEQ ID NO:32; SEQ ID 
5 NO:33 and SEQ ID NO:34. 

The present invention is further directed to transgenic oils in 
pharmaceutically acceptable carriers. The present invention is further directed 
to nutritional supplements, cosmetic agents and infant formulae containing 
transgenic oils. 

1 0 The present invention is fiirther directed to a method for obtaining 

altered long chain polyunsaturated fatty acid biosynthesis comprising the steps 
of: growing a microbe having cells which contain a transgene which encodes a 
transgene expression product which desatiu^tes a fatty acid molecule at carbon 
5 from the carboxyl end of said fatty acid molecule, wherein the trangene is 

1 5 operably associated with an expression control sequence, under conditions 
whereby the transgene is expressed, whereby long chain polyunsaturated fatty 
acid biosynthesis in the cells is altered. 

The present invention is further directed to the use of chain 
polyunsaturated fatty acid selected from the group consisting of ARA, DGLA 
20 and EPA. 

The present invention is further directed toward pharmaceutical 
compositions comprising at least one nutrient selected from the group consisting 
of a vitamin, a mineral, a carbohydrate, a sugar, an amino acid, a free fatty acid, 
a phospholipid, an antioxidant, and a phenolic compound. 
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Brief Description of the Drawings 

Figure 1 shows possible pathways for the synthesis of arachidonic acid 
(20:4 A5, 8, 1 1, 14) and stearidonic acid (18:4 A6, 9, 12, 15) from palmitic acid 
(C|6) from a variety of organisms, including algae, Mortierella and humans. 
5 These PUFAs can serve as precursors to other molecules important for humans 
and other animals, including prostacyclins, leukotrienes, and prostaglandins, 
some of which are shown. 

Figure 2 shows possible pathways for production of PUFAs in addition 
to ARA, including EPA and DHA, for a variety of organisms. 

1 0 Figure 3 A-D shows the DN A sequence of the Mortierella alpina A5- 

desaturase and the deduced amino acid sequence. 

Figure 4 shows the deduced amino acid sequence of the PGR fragment 
(see Example 1 ) 

Figure 5A and 5B show alignments of the protein sequence of the A5- 
1 5 desaturase with A6-desaturases. 

Figure 6 A and 6B show the effect of the timing of substrate addition 
relative to induction on conversion of substrate to product in SC334 containing 
the AS-desaturase gene. 

Figure 7 A and 7B show the effect of inducer concentration on A5- 
20 desaturase expression in SC334. 

Figure 8A and 8B show the effect of induction temperature on AS- 
desaturase activity in SC334. 

Figure 9A and 9B show the effect of host strain on the conversion of 
substrate to product in strains expressing the AS-desaturase gene at ISX. 

2S Figure lOA and lOB show the effect of host strain on the conversion of 

substrate to product in strains expressing the AS-desaturase gene at 30°C. 
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Figure 1 1 shows the effect of a host strain expressing choline transferase 
as well as the A5-desaturase gene on the conversion of substrate to product. 

Figure 12A and 12B show the effect of media composition and 
temperature on the conversion of substrate to product in two host strains 
S expressing the AS-desaturase gene. 

Figure 1 3 shows alignment of the protein sequence of Ma 29 and contig 
253538a. 

Figure 14 shows alignment of the protein sequence of Ma 524 and 
contig 253538a. 

10 

Brief Description of the Sequence Listings 

SEQ ID N0:1 shows a DNA sequence of the Mortierella alpina A5- 
desaturase. 

SEQ ID N0:2 shows an amino acid sequence of Mortierella alpina A5- 
15 _ desaturase. 

SEQ ID NO: 3 shows the deduced amino acid sequence of the M alpina 
PCR fragment (see Example 1). 

SEQ ID NO: 4 - SEQ ID NO: 7 show the deduced amino acid sequences 
of various A6-desaturases, 

20 SEQ ID NO: 8 and SEQ ID NO: 9 show PCR primer sequences for A6- 

desaturases 

SEQ ID NO: 10 shows a primer for reverse transcription of total RNA. 

SEQ ID NO: 1 1 and SEQ ID NO: 12 show amino acid motifs for 
desaturase sequences. 

25 SEQ ID NO: 1 3 and SEQ ID NO: 1 4 show the nucleotide and amino 

acid sequence of a Dictyostelium discoideum desaturase sequence. 
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SEQ ID NO: 15 and SEQ ID NO: 16 show the nucleotide and amino 
acid sequence of a Phaeodactylum tricornutum desaturase sequence. 

SEQ ID NO: 17-20 show the nucleotide and deduced amino acid 
sequence of a Schizochytrium cDNA clone. 

5 SEQ ID NO: 21-27 show nucleotide sequences for human desaturases. 

SEQ ID NO: 28 - SEQ ID NO: 34 show peptide sequences for human 
desaturases. 

Detailed Description of the Invention 

1 0 In order to ensure a complete understanding of the invention, the 

following definitions are provided: 

A5-Desaturase: A5 desaturase is an enzyme which introduces a double 
bond between carbons 5 and 6 from the carboxyl end of a fatty acid molecule. 

A6->Desaturase: A6-desaturase is an enzyme which introduces a double 
15 bond between carbons 6 and 7 from the carboxyl end of a fatty acid molecule. 

A9-Desaturasc: A9-desaturase is an enzyme which introduces a double 
bond between carbons 9 and 10 from the carboxyl end of a fatty acid molecule. 

A12-Desaturase: A 1 2-desaturase is an enzyme which introduces a 
double bond between carbons 12 and 13 from the carboxyl end of a fatty acid 
20 molecule. 

Fatty Acids: Fatty acids are a class of compounds containing a long 
hydrocarbon chain and a terminal carboxylate group. Fatty acids include the 
following: 



Fatty Acid 


12:0 


lauric acid 




16:0 


palmitic acid 




16:1 


palmitoletc acid 
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Fatty Acid 


18:0 


stearic acid 




18:1 


oleic acid 


A9-18:l 


18:2 A5,9 


taxoleic acid 


A5,9-18:2 


18:2 A6,9 


6,9-octadecadienoic acid 


A6,9-18:2 


18:2 


linoleic acid 


A9,12-18:2 (LA) 


18:3 A6,9,12 


gamma- linolenic acid 


A6»9,12-18:3 (GLA) 


18:3 A5,9,12 


pinolenic acid 


A5,9,12-18:3 


18:3 


alpha-linolenic acid 


A9,12,l 5-18:3 (ALA) 


18:4 


stearidonic acid 


A6,9,12.15-18:4 (SDA) 


20:0 


Arachidic acid 




20:1 


Eicoscenic Acid 




22:0 


behehic acid 




22:1 


erucic acid 




22:2 


Docasadienoic acid 




20:4 <o6 


arachidonic acid 


A5.8,ll, 14-20:4 (ARA) 


20:3 ci>6 


u)6-eicosatrienoic 
dihomo-gamma linolenic 


A8,l U 14-20:3 (DGLA) 


20:5 €a3 


Eicosapentanoic 
(Timnodonic acid) 


A5,8,ll, 14.17-20:5 (EPA) 


20:3 o>3 


(o3-eicosatrienoic 


Al 1,16,17-20:3 


20:4(1)3 


b>3-eicosatetraenoic 


A8,H,14,17-20:4 


22:5 (o3 


Docasapentaenoic 


A7,10.13,16,19-22:5 (a)3DPA) 


22:6 G)3 


Docosahexaenoic 
(cervonic acid) 


A4,7,10.13.16,19-22:6 (DHA) 


24:0 


Lignoceric acid 





Taking into account these definitions, the present invention is directed to 
novel DNA sequences, DNA constructs, methods and compositions are 
provided which permit modification of the poly-unsaturated long chain fatty 
5 acid content of, for example, microbial cells or animals. Host cells are 
manipulated to express a sense or antisense transcript of a DNA encoding a 
polypeptide(s) which catalyzes the conversion of DGLA to ARA. The 
substrate(s) for the expressed enzyme may be produced by the host cell or may 
be exogenously supplied. To achieve expression, the transformed DNA is 
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operably associated with transcriptional and translational initiation and 
termination regulatory regions that are functional in the host cell. Constructs 
comprising the gene to be expressed can provide for integration into the genome 
of the host cell or can autonomously replicate in the host cell. For production 

5 of ARA, the expression cassettes generally used include a cassette which 

provides for AS-desaturase activity, particularly in a host cell which produces or 
can take up DGLA. Production of co6-type unsaturated fatty acids, such as 
ARA, is favored in a host microorganism or animal which is substantially free 
of ALA. The host is selected or obtained by removing or inhibiting activity of a 

10 A 15- or 0)3- type desaturase (see Figure 2). The endogenous desaturase activity 
can be affected by providing an expression cassette for an antisense A15 or co3 
transcript, by disrupting a target A15- or co 3 -desaturase gene through insertion, 
substitution and/or deletion of all or part of the target gene, or by adding a Al S- 
or (03-desaturase inhibitor. Production ofLA also can be increased by ' 

1 5 providing expression cassettes for A9 and/or A 1 2-desatiu-ases where their 
respective enzymatic activities are limiting. 



MICROBIAL PRODUCTION OF FATTY ACIDS 

Microbial production of fatty acids has several advantages over 
20 purification from natural sources such as fish or plants. Many microbes are 
knovm with greatly simplified oil compositions compared with those of higher 
organisms, making purification of desired components easier. Microbial 
production is not subject to fluctuations caused by external variables such as 
weather and food supply. Microbially produced oil is substantially free of 
25 contamination by environmental pollutants. Additionally, microbes can provide 
PUFAs in particular forms which may have specific uses. For example, 
Spirulina can provide PUFAs predominantly at the first and third positions of 
triglycerides; digestion by pancreatic lipases preferentially releases fatty acids 
from these positions. Following human or animal ingestion of triglycerides 
30 derived from Spirulina^ these PUFAs are released by pancreatic lipases as fi^ee 
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fatty acids and thus are directly available, for example, for infant brain 
development. Additionally, microbial oil production can be manipulated by 
controlling culture conditions, notably by providing particular substrates for 
microbially expressed enzymes, or by addition of compounds which suppress 

S imdesired biochemical pathways. In addition to these advantages, production of 
fatty acids from recombinant microbes provides the ability to alter the naturally 
occurring microbial fatty acid profile by providing new synthetic pathways in 
the host or by suppressing undesired pathways, thereby increasing levels of 
desired PUFAs, or conjugated fonns thereof, and decreasing levels of undesired 

10 PUFAs. 

PRODUCTION OF FATTY ACIDS IN ANIMALS 

Production of fatty acids in animals also presents several advantages. 
Expression of desaturase genes in animals can produce greatly increased levels 
of desired PUFAs in animal tissues, making recovery from those tissues more 

1 5 economical. For example, where the desired PUFAs are expressed in the breast 
milk of animals, methods of isolating PUFAs from animal milk are well 
established. In additiori to providing a source for purification of desired 
PUFAs, animal breast milk can be manipulated through expression of 
desaturase genes, either alone or in combination with other human genes, to 

20 provide animal milks with a PUFA composition substantially similar to himian 
breast milk during the different stages of infant development. Humanized 
animal milks could serve as infant formulas where human nursing is impossible 
or undesired, or in cases of malnourishment or disease. 

Depending upon the host cell, the availability of substrate, and the 
25 desired end product(s), several polypeptides, particularly desaturases, are of 

interest. By "desaturase" is intended a polypeptide which can desaturate one or 
more fatty acids to produce a mono- or poly-unsaturated fatty acid or precursor 
thereof of interest. Of particular interest are polypeptides which can catalyze 
the conversion of DGLA to produce ARA which includes enzymes which 
30 desaturate at the A5 position. By "polypeptide" is meant any chain of amino 
acids, regardless of length or post-translational modification, for example, 
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glycosylation or phosphorylation. Considerations for choosing a specific 
polypeptide having desaturase activity include the pH optimum of the 
polypeptide, whether the polypeptide is a rate limiting enzyme or a component 
thereof, whether the desaturase used is essential for synthesis of a desired poly- 

5 unsaturated fatty acid, and/or co-factors required by the polypeptide. The 
expressed polypeptide preferably has parameters compatible with the 
biochemical environment of its location in the host cell. For example, the 
polypeptide may have to compete for substrate with other enzymes in the host 
cell. Analyses of the Km and specific activity of the polypeptide in question 

1 0 therefore are considered in determining the suitability of a given polypeptide for 
modifying PUFA production in a given host cell. The polypeptide used in a 
particular situation is one which can function under the conditions present in the 
intended host cell but otherwise can be any polypeptide having desaturase 
activity which has the desired characteristic of being capable of modifying the 

1 5 relative production of a desired PUFA. 

For production of ARA, the DNA sequence used encodes a polypeptide 
having AS-desaturase activity. In particular instances, this can be coupled with 
an expression cassette which provides for production of a polypeptide having 
A6-desaturase activity and the host cell can optionally be depleted of any A15- 

20 desaturase activity present, for example by providing a transcription cassette for 
production of antisense sequences to the A15-desaturase transcription product, 
by disrupting the AlS-desaturase gene, or by using a host cell which naturally 
has, or has been mutated to have, low AlS-desaturase activity. Inhibition of 
undesired desaturase pathways also can be accomplished through the use of 

25 specific desaturase inhibitors such as those described in U.S. Patent No. 

4,778,630. The choice of combination of cassettes used can depend in part on 
the PUFA profile of the host cell. Where the host cell A5-desaturase activity is 
limiting, overexpression of AS-desaturase alone generally will be sufficient to 
provide for enhanced ARA production in the presence of an appropriate 

30 substrate such as DGLA. ARA production also can be increased by providing 
expression cassettes for A9- or A12-desaturase genes when the activities of 
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those desaturases are limiting. A scheme for the synthesis of arachidonic acid 
(20:4 A"^' from palmitic acid (Cia) is shown in Figure 1. A key enzyme in 

this pathway is a A5-desatiirase which converts DH-y-Hnolenic acid (DGLA, 
eicosatrienoic acid) to ARA. Conversion of a-linolenic acid (ALA) to 
5 stearidonic acid by a A6-desaturase is also shown. Production of PUFAs in 
addition to ARA, including EPA and DHA is shown in Figure 2. 

SOURCES OF POLYPEPTIDES 
HAVING DESATURASE ACTIVITY 

A source of polypeptides having desaturase activity and oligonucleotides 
1 0 encoding such polypeptides are organisms which produce a desired poly- 
unsatiu'ated fatty acid. As an example, microorganisms having an ability to 
produce ARA can be used as a source of A5-desaturase activity. Such 
microorganisms include, for example, those belonging to the genera 
Mortierella^ Conidiobolus, Pythium, Phytophathora, Penicillium, 
1 5 Porphyridium, Coidosporium, Mucor, Fusarium, Aspergillus, Rhodoiorula, and 
Entomophthora. Within the genus Porphyridium, of particular interest is 
Porphyridium cruenium. Within the genus Mortierella, of particular interest are 
Moriierella elongata, Mortierella exigua, Mortierella hygrophila, Mortierella 
ramanniam^ var. angulispora, and Mortierella alpina. Within the genus Mucor ^ 
20 of particular interest are Mucor circinelloides and Mucor Javanicus. 

DNAs encoding desired desatiu-ases can be identified in a variety of 
ways. As an example, a source of the desired desaturase, for example genomic 
or cDNA libraries from Moriierella, is screened with detectable enzymatically- 
or chemically-synthesized probes, which can be made from DNA, RNA, or non- 

25 naturally occurring niicleotides, or mixtures thereof. Probes may be 

enzymatically synthesized from DNAs of known desaturases for normal or 
reduced-stringency hybridization methods. Oligonucleotide probes also can be 
used to screen sources and can be based on sequences of known desaturases, 
including sequences conserved among known desaturases, or on peptide 

30 sequences obtained from the desired purified protein. Oligonucleotide probes 

based on amino acid sequences can be degenerate to encompass the degeneracy 
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of the genetic code, or can be biased in favor of the preferred codons of the 
source organism. Oligonucleotides also can be used as primers for PGR from 
reverse transcribed mRNA from a known or suspected source; the PGR product 
can be the full length cDNA or can be used to generate a probe to obtain the 
5 desired fiall length cDNA. Alternatively, a desired protein can be entirely 
sequenced and total synthesis of a DNA encoding that polypeptide performed. 

Once the desired genomic or cDNA has been isolated, it can be 
sequenced by known methods. It is recognized in the art that such methods are 
subject to errors, such that multiple sequencing of the same region is routine and 

1 0 is still expected to lead to measurable rates of mistakes in the resulting deduced 
sequence, particularly in regions having repeated domains, extensive secondary 
structure, or unusual base compositions, such as regions with high GG base 
content. When discrepancies arise, resequencing can be done and can employ 
special methods. Special methods can include altering sequencing conditions 

1 5 by using: different temperatures; different enzymes; proteins which alter the 
ability of oligonucleotides to form higher order structures; altered nucleotides 
such as ITP or methylated dGTP; different gel compositions, for example 
adding formamide; different primers or primers located at different distances 
from the problem region; or different templates such as single stranded DNAs. 

20 Sequencing of mRNA also can be employed. 

For the most part, some or all of the coding sequence.for the polypeptide 
having desaturase activity is from a natural sotirce. In some situations, 
however, it is desirable to modify all or a portion of the codons, for example, to 
enhance expression, by employing host preferred codons. Host preferred 

25 codons can be determined from the codons of highest frequency in the proteins 
expressed in the largest amount in a particular host species of interest. Thus, the 
coding sequence for a polypeptide having desaturase activity can be 
synthesized in whole or in part. All or portions of the DNA also can be 
synthesized to remove any destabilizing sequences or regions of secondary 

30 structure which would be present in the transcribed mRNA. All or portions of 
the DNA also can be synthesized to alter the base composition to one more 
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preferable in the desired host ceil. Methods for synthesizing sequences and 
bringing sequences together are well established in the literature. Jn vitro 
mutagenesis and selection, site-directed mutagenesis, or other means can be 
employed to obtain mutations of naturally occurring desaturase genes to 
5 produce a polypeptide having desaturase activity in vivo with more desirable 
physical and kinetic parameters for function in the host cell, such as a longer 
half-life or a higher rate of production of a desired polyunsaturated fatty acid. 

Mortierella alpina Desaturase 

Of particular interest is the Mortierella alpina AS-desaturase which has 

10 446 amino acids; the amino acid sequence is shown in Figure 3. The gene 
encoding the Mortierella alpina A5 -desaturase can be expressed in transgenic 
microorganisms or animals to effect greater synthesis of ARA from DGLA. 
Other DNAs which are substantially identical to the Mortierella alpina AS- 
desaturase DNA, or which encode polypeptides which are substantially identical 

15 to the Mortierella alpina A5-desaturase polypeptide, also can be used. By 
substantially identical is intended an amino acid sequence or nucleic acid 
sequence exhibiting in order of increasing preference at least 60%, 80%, 90% or 
95% homology to the Mortierella alpina A5-desaturase amino acid sequence or 
nucleic acid sequence encoding the amino acid sequence. For polypeptides, the 

20 length of comparison sequences generally is at least 16 amino acids, preferably 
at least 20 amino acids, or most preferably 35 amino acids. For nucleic acids, 
the length of comparison sequences generally is at least 50 nucleotides, 
preferably at least 60 nucleotides, and more preferably at least 75 nucleotides, 
and most preferably, 1 10 nucleotides. Homology typically is measured using 

25 sequence analysis software, for example, the Sequence Analysis software 
package of the Genetics Computer Group, University of Wisconsin 
Biotechnology Center, 1710 University Avenue, Madison, Wisconsin 53705, 
MEGAlign (DNAStar, Inc., 1228 S. Park St., Madison, Wisconsin 53715), and 
MacVector (Oxford Molecular Group, 2105 S. Bascom Avenue, Suite 200, 

30 Campbell, California 95008). Such software matches similar sequences by 
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assigning degrees of homology to various substitutions, deletions, and other 
modifications. Conservative substitutions typically include substitutions within 
the following groups: glycine and alanine; valine, isoleucine and leucine; 
aspartic acid, glutamic acid, asparagine, and glutamine; serine and threonine; 
5 lysine and arginine; and phenylalanine and tyrosine. Substitutions may also be 
made on the basis of conserved hydrophobicity or hydrophilicity (Kyte and 
Doolittle, y. Mol Biol 1 57: 1 05- 1 32, 1 982), or on the basis of the ability to 
assume similar polypeptide secondary structure (Chou and Fasman, Adv. 
EnzymoL 47: 45-148, 1978). 

10 Other Desaturases 

Encompassed by the present invention are related desaturases from the 
same or other organisms. Such related desaturases include variants of the 
disclosed A5-desaturase naturally occurring within the same or different species 
of Mortierella, as well as homologues of the disclosed A5-desaturase from other 

1 5 species. Also included are desaturases which, although not substantially 
identical to the Mortierella alpina A5-desaturase, desaturate a fatty acid 
molecule at carbon 5 from the carboxyl end of a fatty acid molecule. Related 
desaturases can be identified by their ability to function substantially the same 
as the disclosed desaturases; that is, are still able to effectively convert DGLA 

20 to ARA. Related desaturases also can be identified by screening sequence 
databases for sequences homologous to the disclosed desaturase, by 
hybridization of a probe based on the disclosed desaturase to a library 
constructed from the source organism, or by RT-PCR using mRNA from the 
source organism and primers based on the disclosed desaturase. Such 

25 desaturases include those from humans, Dictyostelium discoideum and 
Phaeodactylum tricornum. 

The regions of a desaturase polypeptide important for desaturase activity 
can be determined through routine mutagenesis, expression of the resulting 
mutant polypeptides and determination of their activities. Mutants may include 
30 deletions, insertions and point mutations, or combinations thereof. A typical 
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ilinctional analysis begins with deletion mutagenesis to determine the N- and C- 
terminal limits of the protein necessary for function, and then internal deletions, 
insertions or point mutants are made to further determine regions necessary for 
function. Other techniques such as cassette mutagenesis or total synthesis also 

S can be used. Deletion mutagenesis is accomplished, for example, by using 
exonucleases to sequentially remove the 5' or 3' coding regions. Kits are 
available for such techniques. After deletion, the coding region is completed by 
ligating oligonucleotides containing start or stop codons to the deleted coding 
region after 5' or 3' deletion, respectively. Altematively, oligonucleotides 

1 0 encoding start or stop codons are inserted into the coding region by a variety of 
methods including site-directed mutagenesis, mutagenic PCR or by ligation 
onto DNA digested at existing restriction sites. Internal deletions can similarly 
be made through a variety of methods including the use of existing restriction 
sites in the DNA, by use of mutagenic primers via site directed mutagenesis or 

1 S mutagenic PCR. Insertions are made through methods such as linker-scaiming 
mutagenesis, site-directed mutagenesis or mutagenic PCR. Point mutations are 
made through techniques such as site-directed mutagenesis or mutagenic PCR. 

Chemical mutagenesis also can be used for identifying regions of a 
desaturase polypeptide important for activity. A mutated construct is expressed, 

20 and the ability of the resulting altered protein to function as a desaturase is 

assayed. Such structure-function analysis can determine which regions may be 
deleted, which regions tolerate insertions, and which point mutations allow the 
mutant protein to function in substantially the same way as the native 
desaturase. Ail such mutant proteins and nucleotide sequences encoding them 

25 are within the scope of the present invention. 

EXPRESSION OF DESATURASE GENES 

Once the DNA encoding a desaturase polypeptide has been obtained, it 
is placed in a vector capable of replication in a host cell, or is propagated in 
vitro by means of techniques such as PCR or long PCR. Replicating vectors 
30 can include plasmids, phage, viruses, cosmids and the like. Desirable vectors 
include those useful for mutagenesis of the gene of interest or for expression of 
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the gene of interest in host cells. The technique of long PCR has made in vitro 
propagation of large constructs possible, so that modifications to the gene of 
interest, such as mutagenesis or addition of expression signals, and propagation 
of the resulting constructs can occur entirely in vitro without the use of a 
S replicating vector or a host cell. 

For expression of a desaturase polypeptide, functional transcriptional 
and translational initiation and termination regions are operably linked to the 
DNA encoding the desaturase polypeptide. Expression of the polypeptide 
coding region can take place in vitro or in a host cell. Transcriptional and 
10 translational initiation and termination regions are derived from a variety of 
nonexclusive sources, including the DNA to be expressed, genes known or 
suspected to be capable of expression in the desired system, expression vectors, 
chemical synthesis, or from an endogenous locus in a host cell. 

Expression In Vitro 

1 5 In vitro expression can be accomplished, for example, by placing the 

coding region for the desaturase polypeptide in an expression vector designed 
for in vitro use and adding rabbit reticulocyte lysate and cofactors; labeled 
amino acids can be incorporated if desired. Such in vitro expression vectors 
may provide some or all of the expression signals necessary in the system used. 

20 These methods are well known in the art and the components of the system are 
commercially available. The reaction mixture can then be assayed directly for 
the polypeptide, for example by determining its activity, or the synthesized 
polypeptide can be purified and then assayed. 

Expression In A Host Cell 

25 Expression in a host cell can be accomplished in a transient or stable 

fashion. Transient expression can occur from introduced constructs which 

contain expression signals functional in the host cell, but which constructs do 

not replicate and rarely integrate in the host cell, or where the host cell is not 

proliferating. Transient expression also can be accomplished by inducing the 
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activity of a regulatable promoter operably linked to the gene of interest, 
although such inducible systems frequently exhibit a low basal level of 
expression. Stable expression can be achieved by introduction of a construct 
that can integrate into the host genome or that autonomously replicates in the 

5 host cell. Stable expression of the gene of interest can be selected for through 
the use of a selectable marker located on or transfected with the expression 
construct, followed by selection for cells expressing the marker. When stable 
expression results from integration, integration of constructs can occur 
randomly within the host genome or can be targeted through the use of 

10 constructs containing regions of homology with the host genome sufficient to 
target recombination with the host locus. Where constructs are targeted to an 
endogenous locus, all or some of the transcriptional and translational regulatory 
regions can be provided by the endogenous locus. 

When increased expression of the desaturase polypeptide in the source 
IS organism is desired, several methods can be employed. Additional genes 

encoding the desaturase polypeptide can be introduced into the host organism. 
Expression from the native desaturase locus also can be increased through 
. homologous recombination, for example by inserting a stronger promoter into 
the host genome to cause increased expression, by removing destabilizing 
20 sequences from either the mRNA or the encoded protein by deleting that 
information from the host genome, or by adding stabilizing sequences to the 
mRNA (USPN 4,910,141). 

When it is desirable to express more than one different gene, appropriate 
regulatory regions and expression methods, introduced genes can be propagated 

25 in the host cell through use of replicating vectors or by integration into the host 
genome. Where two or more genes are expressed from separate replicating 
vectors, it is desirable that each vector has a different means of replication. 
Each introduced construct, whether integrated or not, should have a different 
means of selection and should lack homology to the other constructs to maintain 

30 stable expression and prevent reassortment of elements among constructs. 
Judicious choices of regulatory regions, selection means and method of 
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propagation of the introduced construct can be experimentally determined so 
that all introduced genes are expressed at the necessary levels to provide for 
synthesis of the desired products. 

As an example, where the host cell is a yeast, transcriptional and 
5 translational regions functional in yeast cells are provided, particularly from the 
host species. The transcriptional initiation regulatory regions can be obtained, 
for example from genes in the glycolytic pathway, such as alcohol 
dehydrogenase, glyceraldehyde-3 -phosphate dehydrogenase (GPD), 
phosphoglucoisomerase, phosphoglycerate kinase, etc. or regulatable genes 

10 such as acid phosphatase, lactase, metallothionein, glucoamylase, etc. Any one 
of a number of regulatory sequences can be used in a particular situation, 
depending upon whether constitutive or induced transcription is desired, the 
particular efficiency of the promoter in conjunction with the open-reading frame 
of interest, the ability to join a strong promoter with a control region from a 

1 5 different promoter which allows for inducible transcription, ease of 

construction, and the like. Of particular interest are promoters which are 
activated in the presence of galactose. Galactose-inducible promoters (GALl, 
GAL7, and GAL 10) have been extensively utilized for high level and regulated 
expression of protein in yeast (Lue et al, Mol Cell Biol Vol. 7, p. 3446, 1987; 

20 Johnston, Microbiol Rev. Vol. 5 1 , p. 458, 1 987). Transcription from the GAL 
promoters is activated by the GAL4 protein, which binds to the promoter region 
and activates transcription when galactose is present. In the absence of 
galactose, the antagonist GAL80 binds to GAL4 and prevents GAL4 from 
activating transcription. Addition of galactose prevents GAL80 from inhibiting 

25 activation by GAL4. 

Nucleotide sequences surrounding the translational initiation codon 
ATG have been found to affect expression in yeast cells. If the desired 
polypeptide is poorly expressed in yeast, the nucleotide sequences of exogenous 
genes can be modified to include an efficient yeast translation initiation 
30 sequence to obtain optimal gene expression. For expression in Saccharomyces, 
this can be done by site-directed mutagenesis of an inefficiently expressed gene 
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by fusing it in-frame to an endogenous Saccharomyces gene, preferably a highly 
expressed gene, such as the lactase gene. 

The termination region can be derived from the 3' region of the gene 
from which the initiation region was obtained or from a different gene. A large 

S number of termination regions are known to and have been foimd to be 

satisfactory in a variety of hosts from the same and different genera and species. 
The termination region usually is selected more as a matter of convenience 
rather than because of any particular property. Preferably, the termination 
region is derived from a yeast gene, particularly Saccharomyces, 

1 0 Schizosaccharomyces, Candida or Kluyveromyces, The 3 ' regions of two 

mammalian genes, y interferon and a2 interferon, are also known to function in 
yeast. 

INTRODUCTION OF CONSTRUCTS INTO HOST CELLS 

Constructs comprising the gene of interest may be introduced into a host 
1 5 cell by standard techniques. These techniques include transformation, 

protoplast frision, lipofection, transfection, transduction, conjugation, infection, 
holistic impact, electroporation, microinjection, scraping, or any other method 
which introduces the gene of interest into the host cell. Methods of 
transformation which are used include lithium acetate transformation (Methods 
20 in Enzymology, Vol. 194, p. 186-187, 1991). For convenience, a host cell which 
has been manipulated by any method to take up a DNA sequence or construct 
will be referred to as "transformed" or "recombinant" herein. 

The subject host will have at least have one copy of the expression 
construct and may have two or more, depending upon whether the gene is 

25 integrated into the genome, amplified, or is present on an extrachromosomal 
element having multiple copy numbers. Where the subject host is a yeast, four 
principal types of yeast plasmid vectors can be used: Yeast Integrating plasmids 
(Yips), Yeast Replicating plasmids (YRps), Yeast Centromere plasmids 
(YCps), and Yeast Episomal plasmids (YEps). Yips lack a yeast replication 

30 origin and must be propagated as integrated elements in the yeast genome. 

YRps have a chromosomal ly derived autonomously replicating sequence and 
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are propagated as medium copy number (20 to 40), autonomously replicating, 
unstably segregating plasmids. YCps have both a replication origin and a 
centromere sequence and propagate as low copy number (10-20), autonomously 
replicating, stably segregating plasmids. YEps have an origin of replication 
5 from the yeast 2^m plasmid and are propagated as high copy number, 

autonomously replicating, irregulariy segregating plasmids. The presence of the 
plasmids in yeast can be ensured by maintaining selection for a marker on the 
plasmid. Of particular interest are the yeast vectors pYES2 (a YEp plasmid 
available from Invitrogen, confers uracil prototrophy and a GALl galactose- 

10 inducible promoter for expression), pRS425-pGl (a YEp plasmid obtained from 
Dr. T. H. Chang, Ass. Professor of Molecular Genetics, Ohio State University, 
containing a constitutive GPD promoter and conferring leucine prototrophy), 
and pYX424 (a YEp plasmid having a constitutive TPl promoter and conferring 
leucine prototrophy; Alber, T. and Kawasaki, G. (1982). J, Mol & AppL 

15 Genetics 1:419). 

The transformed host cell can be identified by selection for a marker 
contained on the introduced construct. Alternatively, a separate marker 
construct may be introduced with the desired construct, as many transformation 
techniques introduce many DNA molecules into host cells. Typically, 

20 transformed hosts are selected for their ability to grow on selective media. 
Selective media may incorporate an antibiotic or lack a factor necessary for 
growth of the untransformed host, such as a nutrient or growth factor. An 
introduced marker gene therefor may confer antibiotic resistance, or encode an 
essential growth factor or enzyme, and permit growth on selective media when 

25 expressed in the transformed host. Selection of a transformed host also can 
occur when the expressed marker protein can be detected, either directly or 
indirectly. The marker protein may be expressed alone or as a fiision to another 
protein. The marker protein can be detected by its enzymatic activity; for 
example P galactosidase can convert the substrate X-gal to a colored product, 

30 and luciferase can convert luciferin to a light-emitting product. The marker 
protein can be detected by its light-producing or modifying characteristics; for 
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example, the green fluorescent protein of Aequorea victoria fluoresces when 
illuminated with blue light. Antibodies can be used to detect the marker 
protein or a molecular tag on, for example, a protein of interest. Cells 
expressing the marker protein or tag can be selected, for example, visually, or 
5 by techniques such as FACS or panning using antibodies. For selection of yeast 
transformants, any marker that functions in yeast may be used. Desirably, 
resistance to kanamycin and the amino glycoside G41 8 are of interest, as well as 
ability to grow on media lacking uracil, leucine, lysine or tryptophan. 

The A5-desaturase-mediated production of PUFAs can be performed in 

1 0 either prokaryotic or eukaryotic host cells. Prokaryotic cells of interest include 
Eschericia^ Bacillus, Lactobacillus, cyanobacteria and the like. Eukaryotic 
cells include mammalian cells such as those of lactating animals, avian cells 
such as of chickens, and other cells amenable to genetic manipulation including 
insect, fungal, and algae cells. The cells may be cultured or formed as part or 

1 5 all of a host organism including an animal. Viruses and bacteriophage also may 
be used with the cells in the production of PUFAs, particularly for gene transfer, 
cellular targeting and selection. In a preferred embodiment, the host is any 
. microorganism or aninial which produces DGLA and/or can assimilate 
exogenously supplied DGLA, and preferably produces large amounts of DGLA. 

20 Examples of host animals include mice, rats, rabbits, chickens, quail, turkeys, 
bovines, sheep, pigs, goats, yaks, etc., which are amenable to genetic 
manipulation and cloning for rapid expansion of the transgene expressing 
population. For animals, a A5-desaturase transgene can be adapted for 
expression in target organelles, tissues and body fluids through modification of 

25 the gene regulatory regions. Of particular interest is the production of PUFAs 
in the breast milk of the host animal. 



Expression In Yeast 

Examples of host microorganisms include Saccharomyces cerevisiae, 
Saccharomyces carlsbergensis, or other yeast such as Candida, Kluyveromyces 
30 or other fungi, for example, filamentous fungi such as Aspergillus, Neurospora, 
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Penicillium, etc. Desirable characteristics of a host microorganism are, for 
example, that it is genetically well characterized, can be used for high level 
expression of the product using ultra-high density fermentation, and is on the 
GRAS (generally recognized as safe) list since the proposed end product is 
5 intended for ingestion by humans. Of particular interest is use of a yeast, more 
particularly baker's yeast (5. cerevisiae), as a cell host in the subject invention. 
Strains of particular interest are SC334 (Mat a pep4-3 prbl-1 122 ura3-52 leu2- 
3, 1 12 regKSOl gall ; Gene 83:57-64, 1989, Hovland P. ei ai\ YTC34 (a ade2- 
101 his3A200 lys2-801 ura3-52; obtained from Dr. T. H. Chang, Ass. Professor 

1 0 of Molecular Genetics, Ohio State University), YTC41 (a/a ura3-52/ura3=52 
Iys2-801/lys2-801 ade2-101/ade2-101 trpl-Al/trpl-Al his3A200/his3A200 
leu2Al/leu2Al; obtained from Dr. T. H. Chang, Ass. Professor of Molecular 
Genetics, Ohio State University), BJ1995 (obtained from the Yeast Genetic 
Stock Centre, 1021 Donner Laboratory, Berkeley, CA 94720), INVSCl (Mat a 

1 5 hiw3Al leu2 trpl -289 ura3-52; obtained from Invitrogen, 1 600 Faraday Ave., 
Carlsbad, CA 92008) and 1NVSC2 (Mat a his3A200 ura3-167; obtained from 
Invitrogen). 

Expression In Avian Species 

For producing PUFAs in avian species and cells, such as chickens, 

20 turkeys, quail and ducks, gene transfer can be performed by introducing a 

nucleic acid sequence encoding a A5-desaturase into the cells following 

procedures known in the art. If a transgenic animal is desired, pluripotent stem 

cells of embryos can be provided with a vector carrying a A5-desaturase 

encoding transgene and developed into adult animal (USPN 5,162,215; Ono et 

25 aL (1 996) Comparative Biochemistry and Physiology A 77i(3):287-292; WO 

9612793; WO 9606160). In most cases, the transgene will be modified to 

express high levels of the desaturase in order to increase production of PUFAs. 

The transgene can be modified, for example, by providing transcriptional and/or 

translational regulatory regions that function in avian cells, such as promoters 

30 which direct expression in particular tissues and egg parts such as yolk. The 
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gene regulatory regions can be obtained from a variety of sources, including 
chicken anemia or avian leukosis viruses or avian genes such as a chicken 
ovalbumin gene. 

Expression In Insect Cells 

5 Production of PUFAs in insect cells can be conducted using baculovirus 

expression vectors harboring a A5-desaturase transgene. Baculovirus 
expression vectors are available from several commercial sources such as 
Clonetech. Methods for producing hybrid and transgenic strains of algae, such 
as marine algae, w^hich contain and express a desaturase transgene also are 

10 provided. For example, transgenic marine algae may be prepared as described 
in USPN 5,426,040. As with the other expression systems described above, the 
timing, extent of expression and activity of the desaturase transgene can be 
regulated by fitting the polypeptide coding sequence with the appropriate 
transcriptional and translational regulatory regions selected for a particular use. 

15 Of particular interest are promoter regions which can be induced under 
preselected growth conditions. For example, introduction of temperature 
sensitive and/or metabolite responsive mutations into the desaturase transgene 
coding sequences, its regulatory regions, and/or the genome of cells into which 
the transgene is introduced can be used for this purpose. 

20 Expression In Plants 

Production of PUFA*s in plants can be conducted using various plant 
transformation systems such as the use of Agrobacterium tumefaciens, plant 
viruses, particle cell transformation and the like which are disclosed in 
Applicant's related applications U.S. Application Serial Nos. 08/834,033 and 
25 08/956,985 and continuation-in-part applications filed simultaneously v«th this 
application all of which are hereby incorporated by reference. 

The transformed host cell is grovm under appropriate conditions adapted 
for a desired end result. For host cells grown in culture, the conditions are 
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typically optimized to produce the greatest or most economical yield of PUFAs, 
which relates to the selected desaturase activity. Media conditions which may 
be optimized include: carbon source, nitrogen source, addition of substrate, 
final concentration of added substrate, form of substrate added, aerobic or 

5 anaerobic growth, growth temperature, inducing agent, induction temperature, 
growth phase at induction, growth phase at harvest, pH, density, and 
maintenance of selection. Microorganisms such as yeast, for example, are 
preferably grovm using selected media of interest, which include yeast peptone 
broth (YPD) and minimal media (contains amino acids, yeast nitrogen base, and 

1 0 ammonium sulfate, and lacks a component for selection, for example uracil). 
Desirably, substrates to be added are first dissolved in ethanol. Where 
necessary, expression of the polypeptide of interest may be induced, for 
example by including or adding galactose to induce expression fi-om a GAL 
promoter. 

15 Expression In An Animal 

Expression in cells of a host animal can likewise be accomplished in a 
transient or stable manner. Transient expression can be accomplished via known 
methods, for example infection or lipofection, and can be repeated in order to 
maintain desired expression levels of the introduced construct (see Ebert, PCT 

20 publication WO 94/05782). Stable expression can be accomplished via 

integration of a construct into the host genome, resulting in a transgenic animal. 
The construct can be introduced, for example, by microinjection of the construct 
into the pronuclei of a fertilized egg, or by transfection, retroviral infection or 
other techniques whereby the construct is introduced into a cell line which may 

25 form or be incorporated into an adult animal (U.S. Patent No. 4,873,191 ; U.S. 
Patent No. 5,530,177; U.S. Patent No. 5,565,362; U.S. Patent No. 5,366,894; 
Wilmut et al (1997) Nature 385:810). The recombinant eggs or embryos are 
transferred to a surrogate mother (U.S. Patent No. 4,873,191 ; U.S. Patent No. 
5,530,177; U.S. Patent No. 5,565,362; U.S. Patent No. 5,366,894; Wilmut et al 

30 (supra)). 
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After birth, transgenic animals are identified, for example, by the 
presence of an introduced marker gene, such as for coat color, or by PGR or 
Southern blotting from a blood, milk or tissue sample to detect the introduced 
construct, or by an immunological or enzymological assay to detect the 
5 expressed protein or the products produced therefrom (U.S. Patent No. 

4,873,191; U.S. Patent No. 5,530,177; U.S. Patent No. 5,565,362; U.S. Patent 
No. 5,366,894; Wilmut et aL (supra)). The resulting transgenic animals may be 
entirely transgenic or may be mosaics, having the transgenes in only a subset of 
their cells. The advent of mammalian cloning, accomplished by fusing a 
10 nucleated cell with an enucleated egg, followed by transfer into a surrogate 

mother, presents the possibility of rapid, large-scale production upon obtaining 
a "founder" animal or cell comprising the introduced construct; prior to this, it 
was necessary for the transgene to be present in the germ line of the animal for 
propagation (Wilmut et aL (supra)). 

1 5 Expression in a host animal presents certain efficiencies, particularly 

where the host is a domesticated animal. For production of PUFAs in a fluid 
readily obtainable from the host animal, such as milk, the desaturase transgene 
can be expressed in mammary cells from a female host, and the PUPA content 
of the host cells altered. The desaturase transgene can be adapted for expression 

20 so that it is retained in the mammary cells, or secreted into milk, to form the 

PUFA reaction products localized to the milk (PCT publication WO 95/24488). 
Expression can be targeted for expression in mammary tissue using specific 
regulatory sequences, such as those of bovine a-lactalbumin, a-casein, P- 
casein, y-casein, K-casein, P-lactoglobulin, or whey acidic protein, and may 

25 optionally include one or more introns and/or secretory signal sequences (U.S. 
Patent No. 5,530,1 77; Rosen, U.S. Patent No. 5,565,362; Clark et aL, U.S. 
Patent No. 5,366,894; Gamer et aL, PCT publication WO 95/23868). 
Expression of desaturase transgenes, or antisense desaturase transcripts, adapted 
in this manner can be used to alter the levels of specific PUFAs, or derivatives 

30 thereof, found in the animals milk. Additionally, the A5-desaturase transgene 
can be expressed either by itself or with other transgenes, in order to produce 
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animal milk containing higher proportions of desired PUFAs or PUFA ratios 
and concentrations that resemble human breast milk (Prieto et aL, PCX 
publication WO 95/24494). 

PURIFICATION OF FATTY ACIDS 

5 The fatty acids desaturated in the A5 position may be found in the host 

microorganism or animal as free fatty acids or in conjugated forms such as 
acylglycerols, phospholipids, sulfolipids or glycolipids, and may be extracted 
from the host cell through a variety of means well-known in the art. Such 
means may include extraction with organic solvents, sonication, supercritical 

1 0 fluid extraction using for example carbon dioxide, and physical means such as 
presses, or combinations thereof. Of particular interest is extraction with 
methanol and chloroform. Where desirable, the aqueous layer can be acidified 
to protonate negatively charged moieties and thereby increase partitioning of 
desired products into the organic layer. After extraction, the organic solvents 

1 5 can be removed by evaporation under a stream of nitrogen. When isolated in 
conjugated forms, the products may be enzymatically or chemically cleaved to 
release the free fatty acid or a less complex conjugate of interest, and can then 
be subject to fiirther manipulations to produce a desired end product. Desirably, 
conjugated forms of fatty acids are cleaved with potassium hydroxide. 

20 If ftuiher purification is necessary, standard methods can be employed. 

Such methods may include extraction, treatment with urea, fractional 
crystallization, HPLC, fractional distillation, silica gel chromatography, high 
speed centrifrigation or distillation, or combinations of these techniques. 
Protection of reactive groups, such as the acid or alkenyl groups, may be done at 

25 any step through known techniques, for example alkylation or iodination. 

Methods used include methylation of the fatty acids to produce methyl esters. 
Similarly, protecting groups may be removed at any step. Desirably, 
purification of fractions containing ARA, DHA and EPA may be accomplished 
by treatment with urea and/or fractional distillation. 
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USES OF FATTY ACIDS 

There are several uses for fatty acids of the subject invention. Probes 
based on the DN As of the present invention may find use in methods for 
isolating related molecules or in methods to detect organisms expressing 
5 desaturases. When used as probes, the DNAs or oligonucleotides must be 
detectable. This is usually accomplished by attaching a label either at an 
internal site, for example via incorporation of a modified residue, or at the 5' or 
3' terminus. Such labels can be directly detectable, can bind to a secondary 
molecule that is detectably labeled, or can bind to an imlabelled secondary 

1 0 molecule and a detectably labeled tertiary molecule; this process can be 
extended as long as is practical to achieve a satisfactorily detectable signal 
without unacceptable levels of backgrotmd signal. Secondary, tertiary, or 
bridging systems can include use of antibodies directed against any other 
molecule, including labels or other antibodies, or can involve any molecules 

1 5 v/hich bind to each other, for example a biotin-streptavidin/avidin system. 
Detectable labels typically include radioactive isotopes, molecules which 
chemically or enzymatically produce or alter light, enzymes which produce 
detectable reaction products, magnetic molecules, fluorescent molecules or 
molecules whose fluorescence or light-emitting characteristics change upon 

20 binding. Examples of labelling methods can be found in USPN 5,0 11, 770. 
Alternatively, the binding of target molecules can be directly detected by 
measuring the change in heat of solution on binding of probe to target via 
isothermal titration calorimetry, or by coating the probe or target on a surface 
and detecting the change in scattering of light from the surface produced by 

25 binding of target or probe, respectively, as may be done vsdth the BIAcore 
system. 

PUFAs produced by recombinant means find applications in a wide 
variety of areas. Supplementation of humans or animals with PUFAs in various 
forms can resuk in increased levels not only of the added PUFAs, but of their 
30 metabolic progeny as well. For example, where the inherent A5-desaturase 
pathway is dysfunctional in an individual, treatment with ARA can result not 
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only in increased levels of ARA, but also of downstream products of ARA such 
as prostaglandins (see Figure 1). Complex regulatory mechanisms can make it 
desirable to combine various PUFAs, or to add different conjugates of PUFAs, 
in order to prevent, control or overcome such mechanisms to achieve the 
5 desired levels of specific PUFAs in an individual. 

NUTRITIONAL COMPOSITIONS 

The present invention also includes nutritional compositions. Such 
compositions, for purposes of the present invention, include any food or 
preparation for human consumption including for enteral or parenteral 
1 0 consumption, which when taken into the body (a) serve to nourish or build up 
tissues or supply energy and/or (b) maintain, restore or support adequate 
nutritional status or metabolic function. 

The nutritional composition of the present invention comprises at least 
one oil or acid produced in accordance with the present invention and may 

15 either be in a solid or liquid form. Additionally, the composition may include 
edible macronutrients, vitamins and minerals in amounts desired for a particular 
use. The amount of such ingredients will vary depending on whether the 
composition is intended for use with normal, healthy infants, children or adults 
having specialized needs such as those which accompany certain metabolic 

20 conditions (e.g., metabolic disorders). 

Examples of macronutrients which may be added to the composition 
include but are not limited to edible fats, carbohydrates and proteins. Examples 
of such edible fats include but are not limited to coconut oil, soy oil, and mono- 
and diglycerides. Examples of such carbohydrates include but are not limited to 
25 glucose, edible lactose and hydrolyzed search. Additionally, examples of 

proteins which may be utilized in the nutritional composition of the invention 
include but are not limited to soy proteins, electrodialysed whey , 
electrodialysed skim milk, milk whey, or the hydrolysates of these proteins. 

With respect to vitamins and minerals, the following may be added to 
30 the nutritional compositions of the present invention: calcium, phosphorus, 
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potassium, sodium, chloride, magnesium, manganese, iron, copper, zinc, 
selenium, iodine, and Vitamins A, E, D, C, and the B complex. Other such 
vitamins and minerals may also be added. 

The components utilized in the nutritional compositions of the present 
5 invention will of semi-purified or purified origin. By semi-purified or purified 
is meant a material which has been prepared by purification of a natural 
material or by synthesis. 

Examples of nutritional compositions of the present invention include 
but are not limited to infant formulas, dietary supplements, and rehydration 
10 compositions. Nutritional compositions of particular interest include but are not 
limited to those utilized for enteral and parenteral supplementation for infants, 
specialist infant formulae, supplements for the elderly, and supplements for 
those with gastrointestinal difficulties and/or malabsorption. 

Nutritional Compositions 

1 5 A typical nutritional composition of the present invention will contain 

edible macronutrients, vitamins and minerals in amounts desired for a particular 
use. The amounts of such ingredients will vary depending on whether the 
formulation is intended for use with normal, healthy individuals temporarily 
exposed to stress, or to subjects having specialized needs due to certain chronic 

20 or acute disease states (e.g., metabolic disorders). It will be understood by 
persons skilled in the art that the components utilized in a nutritional 
formulation of the present invention are of semi-purified or purified origin. By 
semi-purified or purified is meant a material that has been prepared by 
purification of a natural material or by synthesis. These techniques are well 

25 known in the art (See, e.g.. Code of Federal Regulations for Food Ingredients 
and Food Processing; Recommended Dietary Allowances, 10^ Ed., National 
Academy Press, Washington, D.C., 1989). 

In a preferred embodiment, a nutritional formulation of the present 
invention is an enteral nutritional product, more preferably an adult or child 
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enteral nutritional product. Accordingly in a further aspect of the invention, a 
nutritional formulation is provided that is suitable for feeding adults, who are 
experiencing stress. The formula comprises, in addition to the PUFAs of the 
invention; macronutrients, vitamins and minerals in amounts designed to 
5 provide the daily nutritional requirements of adults. 

The macronutritional components include edible fats, carbohydrates and 
proteins. Exemplary edible fats are coconut oil, soy oil, and mono- and 
diglycerides and the PUFA oils of this invention. Exemplary carbohydrates are 
glucose, edible lactose and hydrolyzed cornstarch. A typical protein source 
10 would be soy protein, electrodialysed whey or electrodialysed skim milk or milk 
whey, or the hydroly sates of these proteins, although other protein sources are 
also available and may be used. These macronutrients would be added in the 
form of commonly accepted nutritional compounds in amount equivalent to 
those present in human milk or an energy basis, i.e., on a per calorie basis. 

1 S Methods for formulating liquid and enteral nutritional formulas are well 

knovm in the art and are described in detail in the examples. 

The enteral formula can be sterilized and subsequently utilized on a 
ready-to- feed (RTF) basis or stored in a concentrated liquid or a powder. The 
powder can be prepared by spray drying the enteral formula prepared as 

20 indicated above, and the formula can be reconstituted by rehydrating the 

concentrate. Adult and infant nutritional formulas are well known in the art and 
commercially available (e.g., Similac®, Ensure®, Jevity® and Alimentum® 
from Ross Products Division, Abbott Laboratories). An oil or acid of the 
present invention can be added to any of these formulas in the amounts 

25 described below. 

The energy density of the nutritional composition when in liquid form, 
can typically range from about 0.6 Kcal to 3.0 Kcal per ml. When in solid or 
powdered form, the nutritional supplement can contain from about 1 .2 to more 
than 9 Kcals per gm, preferably 3 to 7 Kcals per gram. In general, the 
30 osmolality of a liquid product should be less than 700 mOsm and more 
preferably less than 660 mOsm. 

-37- 



wo 98/46765 PCT/US98/07422 

The nutritional fonnula would typically include vitamins and minerals, 
in addition to the PUFAs of the invention, in order to help the individual ingest 
the minimum daily requirements for these substances. In addition to the PUFAs 
listed above, it may also be desirable to supplement the nutritional composition 
S with zinc, copper, and folic acid in addition to antioxidants. It is believed that 
these substances will also provide a boost to the stressed immune system and 
thus will provide further benefits to the individual. The presence of zinc, 
copper or folic acid is optional and is not required in order to gain the beneficial 
effects on immune suppression. Likewise a pharmaceutical composition can be 
1 0 supplemented with these same substances as well. 

In a more preferred embodiment, the nutritional contains, in addhion to 
the antioxidant system and the PUFA component, a source of carbohydrate 
wherein at least 5 weight % of said carbohydrate is an indigestible 
oligosaccharide. In yet a more preferred embodiment, the nutritional 
15 composition additionally contains protein, taurine and carnitine. 

The PUFAs, or derivatives thereof, made by the disclosed method can 
be used as dietary substitutes, or supplements, particularly infant formulas, for 
patients undergoing intravenous feeding or for preventing or treating 
malnutrition. Typically, human breast milk has a fatty acid profile comprising 

20 from about 0. 1 5 % to about 0.36 % as DHA, from about 0.03 % to about 0. 1 3 % 
as EPA, from about 0.30 % to about 0.88 % as ARA, from about 0.22 % to 
about 0.67 % as DGLA, and from about 0.27 % to about 1.04 % as GLA. 
Additionally, the predominant triglyceride in human milk has been reported to 
be l,3-di-oleoyl-2-palmitoyI, with 2-palmitoyl glycerides reported as better 

25 absorbed than 2-oleoyl or 2-lineoyl glycerides (USPN 4,876,107). Thus, fatty 
acids such as ARA, DGLA, GLA and/or EPA produced by the invention can be 
used to alter the composition of infant formulas to better replicate the PUFA 
composition of human breast milk. In particular, an oil composition for use in a 
pharmacologic or food supplement, particularly a breast milk substitute or 

30 supplement, will preferably comprise one or more of ARA, DGLA and GLA. 
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More preferably the oil will comprise from about 0.3 to 30% ARA, from about 
0.2 to 30% DGLA, and from about 0.2 to about 30% GLA, 

In addition to the concentration, the ratios of ARA, DGLA and GLA can 
be adapted for a particular given end use. When formulated as a breast milk 

5 supplement, or substitute an oil composition which contains two or more of 
ARA, DGLA and GLA will be provided in a ratio of about 1 :19:30 to about 
6:1 :0.2, respectively. For example, the breast milk of animals can vary in ratios 
of ARA:DGLA:DGL ranging from 1:19:30 to 6:1:0,2, which includes 
intermediate ratios which are preferably about 1 : 1 : 1 , 1 :2: 1 , 1 : 1 :4. When 

1 0 produced together in a host cell, adjusting the rate and percent of conversion of 
a precursor substrate such as GLA and DGLA to ARA can be used to precisely 
control the PUFA ratios. For example, a 5% to 10% conversion rate of DGLA 
to ARA can be used to produce an ARA to DGLA ratio of about 1:19, whereas 
a conversion rate of about 75% to 80% can be used to produce an ARA to 

1 5 DGLA ratio of about 6:1. Therefore, whether in a cell culture system or in a 
host animal, regulating the timing, extent and specificity of desaturase 
expression as described can be used to modulate the PUFA levels and ratios. 
Depending on the expression system used, e.g., cell culture or an animal 
expressing oil(s) in its milk, the oils also can be isolated and recombined in the 

20 desired concentrations and ratios. Amounts of oils providing these ratios of 
PUFA can be determined following standard protocols. PUFAs, or host cells 
containing them, also can be used as animal food supplements to alter an 
animal's tissue or milk fatty acid composition to one more desirable for human 
or animal consumption. 

25 For dietary supplementation, the purified PUFAs, or derivatives thereof, 

may be incorporated into cooking oils, fats or margarines formulated so that in 
normal use the recipient would receive the desired amount. The PUFAs may 
also be incorporated into infant formulas, nutritional supplements or other food 
products, and may find use as anti-inflammatory or cholesterol lowering agents. 
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Pharmaceutical Compositions 

The present invention also encompasses a pharmaceutical composition 
comprising one or more of the acids and/or resulting oils produced in 
accordance with the methods described herein. More specifically, such a 
5 pharmaceutical composition may comprise one or more of the acids and/or oils 
as well as a standard, well-known, non-toxic pharmaceutically acceptable 
carrier, adjuvant or vehicle such as, for example, phosphate buffered saline, 
water, ethanol, polyols, vegetable oils, a wetting agent or an emulsion such as a 
water/oil emulsion. The composition may be in either a liquid or solid form. 
1 0 For example, the composition may be in the form of a tablet, capsule, ingestible 
liquid or powder, injectible, or topical ointment or cream. 

Possible routes of administration include, for example, oral, rectal and 
parenteral. The route of administration will, of course, depend upon the desired 
effect. For example, if the composition is being utilized to treat rough, dry, or 
1 5 aging skin, to treat injured or burned skin, or to treat skin or hair affected by a 
disease or condition, it may perhaps be applied topically. 

The dosage of the composition to be administered to the patient may be 
determined by one of ordinary skill in the art and depends upon various factors 
such as weight of the patient, age of the patient, immune status of the patient, 
20 etc. 

With respect to form, the composition may be, for example, a solution, a 
dispersion, a suspension, an emulsion or a sterile powder which is then 
reconstituted. 

Additionally, the composition of the present invention may be utilized 
25 for cosmetic purposes. It may be added to pre-existing cosmetic compositions 
such that a mixture is formed or may be used as a sole composition. 

Pharmaceutical compositions may be utilized to administer the PUFA 

component to an individual. Suitable pharmaceutical compositions may 

comprise physiologically acceptable sterile aqueous or non-aqueous solutions, 

30 dispersions, suspensions or emulsions and sterile powders for reconstitution into 
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Sterile solutions or dispersions for ingestion. Examples of suitable aqueous and 
non-aqueous carriers, diluents, solvents or vehicles include water, ethanol, 
polyols (propyleneglycol, polyethyleneglycol, glycerol, and the like), suitable 
mixtures thereof, vegetable oils (such as olive oil) and injectable organic esters 

5 such as ethyl oleate. Proper fluidity can be maintained, for example, by the 
maintenance of the required particle size in the case of dispersions and by the 
use of siufactants. It may also be desirable to include isotonic agents, for 
example sugars, sodium chloride and the like. Besides such inert diluents, the 
composition can also include adjuvants, such as wetting agents, emulsifying and 

10 suspending agents, sweetening, flavoring and perfuming agents. 

Suspensions, in addition to the active compounds, may contain 
suspending agents, as for example, ethoxylated isostearyl alcohols, 
polyoxyethylene sorbitol and sorbitan esters, microcrystalline cellulose, 
aluminum metahydroxide, bentonite, agar-agar and tragacanth or mixtures of 
1 S these substances, and the like. 

Solid dosage forms such as tablets and capsules can be prepared using 
techniques well known in the art. For example, PUFAs of the invention can be 
tableted with conventional tablet bases such as lactose, sucrose, and comstarch 
in combination with binders such as acacia, comstarch or gelatin, disintegrating 

20 agents such as potato starch or alginic acid and a lubricant such as stearic acid 
or magnesium stearate. Capsules can be prepared by incorporating these 
excipients into a gelatin capsule along with the antioxidants and the PUFA 
component. The amount of the antioxidants and PUFA component that should 
be incorporated into the pharmaceutical formulation should fit within the 

25 guidelines discussed above. 

As used in this application, the term "treat" refers to either preventing, or 
reducing the incidence of, the undesired occurrence. For example, to treat 
immune suppression refers to either preventing the occurrence of this 
suppression or reducing the amount of such suppression. The terms "patient" 
30 and "individual" are being used interchangeably and both refer to an animal. 
The term "animal" as used in this application refers to any warm-blooded 
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mammal including, but not limited to, dogs, humans, monkeys, and apes. As 
used in the application the term "about" refers to an amount varying from the 
stated range or number by a reasonable amount depending upon the context of 
use. Any numerical number or range specified in the specification should be 
5 considered to be modified by the term about. 

"Dose" and "serving" are used interchangeably and refer to the amount 
of the nutritional or pharmaceutical composition ingested by the patient in a 
single setting and designed to deliver effective amounts of the antioxidants and 
the structured triglyceride. As will be readily apparent to those skilled in the 

10 art, a single dose or serving of the liquid nutritional powder should supply the 
amount of antioxidants and PUF As discussed above. The amount of the dose or 
serving should be a volume that a typical adult can consume in one sitting. This 
amoimt can vary widely depending upon the age, weight, sex or medical 
condition of the patient. However as a general guideline, a single serving or 

1 5 dose of a liquid nutritional produce should be considered as encompassing a 
volume from 100 to 600 ml, more preferably from 125 to 500 ml and most 
preferably from 125 to 300 ml. 

The PUF As of the present invention may also be added to food even 
when supplementation of the diet is not required. For example, the composition 
20 may be added to food of any type including but not limited to margarines, 
modified butters, cheeses, milk, yogurt, chocolate, candy, snacks, salad oils, 
cooking oils, cooking fats, meats, fish and beverages. 

Pharmaceutical Applications 

For pharmaceutical use (human or veterinary), the compositions are 

25 generally administered orally but can be administered by any route by which 

they may be successfully absorbed, e.g., parenterally (i.e. subcutaneously, 

intramuscularly or intravenously), rectally or vaginally or topically, for 

example, as a skin ointment or lotion. The PUF As of the present invention may 

be administered alone or in combination with a pharmaceutically acceptable 

30 carrier or excipient. Where available, gelatin capsules are the preferred form of 

oral administration. Dietary supplementation as set forth above also can 
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provide an oral route of administration. The unsaturated acids of the present 
invention may be administered in conjugated forms, or as salts, esters, amides 
or prodrugs of the fatty acids. Any pharmaceutically acceptable salt is 
encompassed by the present invention; especially preferred are the sodium, 
5 potassium or lithium salts. Also encompassed are the N-alkylpolyhydroxamine 
salts, such as N-methyl glucamine, found in PCX publication WO 96/33155. 
The preferred esters are the ethyl esters. As solid salts, the PUFAs also can be 
administered in tablet form. For intravenous administration, the PUFAs or 
derivatives thereof may be incorporated into commercial formulations such as 

10 Intralipids. The typical normal adult plasma fatty acid profile comprises 6.64 to 
9.46% of ARA, 1.45 to 3.1 1% of DGLA, and 0.02 to 0.08% of GLA. These 
PUFAs or their metabolic precursors can be administered, either alone or in 
mixtures v^ith other PUFAs, to achieve a normal fatty acid profile in a patient. 
Where desired, the individual components of formulations may be individually 

1 5 provided in kit form, for single or multiple use. A typical dosage of a particular 
fatty acid is irom 0. 1 mg to 20 g, or even 1 00 g daily, and is preferably from 1 0 
mg to 1 , 2, 5 or 1 0 g daily as required, or molar equivalent amounts of 
derivative forms thereof. Parenteral nutrition compositions comprising from 
about 2 to about 30 weight percent fatty acids calculated as triglycerides are 

20 encompassed by the present invention; preferred is a composition having from 
about 1 to about 25 weight percent of the total PUFA composition as GLA 
(USPN 5,196,198). Other vitamins, and particularly fat-soluble vitamins such 
as vitamin A, D, E and L-camitine can optionally be included. Where desired, a 
preservative such as a tocopherol may be added, typically at about 0. 1% by 

25 weight. 

Suitable pharmaceutical compositions may comprise physiologically 
acceptable sterile aqueous or non-aqueous solutions, dispersions, suspensions or 
emulsions and sterile powders for reconstitution into sterile injectible solutions 
or dispersions. Examples of suitable aqueous and non-aqeuous carriers, 
30 diluents, solvents or vehicles include water, ethanol, polyols (propylleneglyol, 
polyethylenegycol, glycerol, and the like), suitable mixtures thereof, vegetable 
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oils (such as olive oil) and injectable organic esters such as ehyl oleate. Proper 
fluidity can be maintained, for example, by the maintenance of the required 
particle size in the case of dispersions and by the use of surfactants. It may also 
be desirable to include isotonic agents, for example sugars, sodiimi chloride and 
5 the like. Besides such inert diluents, the composition can also include 
adjuvants, such as wetting agents, emulsifying and suspending agents, 
sweetening, flavoring and perfuming agents. 

Suspensions in addition to the active compounds, may contain 
suspending agents, as for example, ethoxylated isostearyl alcohols, 
10 polyoxyethylene sorbitol and sorbitan esters, microcrystalline cellulose, 

aluminum metahydroxide, bentonite, agar-agar and tragacanth, or mixtures of 
these substances and the like. 

An especially preferred pharmaceutical composition contains 
diacetyltartaric acid esters of mono- and diglycerides dissolved in an aqueous 

15 medium or solvent. Diacetyltartaric acid esters of mono- and diglycerides have 
an HLB value of about 9-12 and are significantly more hydrophilic than existing 
antimicrobial lipids that have HLB values of 2-4. Those existing hydrophobic 
lipids cannot be formulated into aqueous compositions. As disclosed herein, 
those lipids can now be solubilized into aqueous media in combination with 

20 diacetyltartaric acid esters of mono-and diglycerides. In accordance with this 
embodiment, diacetyltartaric acid esters of mono- and diglycerides (e.g., 
DATEM-C12:0) is melted with other active antimicrobial lipids (e.g., 18:2 and 
12:0 monoglycerides) and mixed to obtain a homogeneous mixture. 
Homogeneity allows for increased antimicrobial activity. The mixture can be 

25 completely dispersed in water. This is not possible without the addition of 

diacetyltartaric acid esters of mono- and diglycerides and premixing with other 
monoglycerides prior to introduction into water. The aqueous composition can 
then be admixed under sterile conditions vsdth physiologically acceptable 
diluents, preservatives, buffers or propellants as may be required to form a spray 

30 or inhalant. 
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The present invention also encompasses the treatment of numerous 
disorders with fatty acids. Supplementation with PUFAs of the present 
invention can be used to treat restenosis after angioplasty. Symptoms of 
inflammation, rheiunatoid arthritis, and asthma and psoriasis can be treated with 
5 the PUFAs of the present invention. Evidence indicates that PUFAs may be 
involved in calcium metabolism, suggesting that PUFAs of the present 
invention may be used in the treatment or prevention of osteoporosis and of 
kidney or urinary tract stones. 

The PUFAs of the present invention can be used in the treatment of 
10 cancer. Malignant cells have been shown to have altered fatty acid 

compositions; addition of fatty acids has been shovsm to slow their growth and 
cause cell death, and to increase their susceptibility to chemotherapeutic agents. 
GLA has been shown to cause reexpression on cancer cells of the E-cadherih 
cellular adhesion molecules, loss of which is associated with aggressive 
1 5 metastasis. Clinical testing of intravenous administration of the water soluble 
lithium salt of GLA to pancreatic cancer patients produced statistically 
significant increases in their survival. PUFA supplementation may also be 
useful for treating cachexia associated with cancer. 

The PUFAs of the present invention can also be used to treat diabetes 
20 (USPN 4,826,877; Horrobin et aL, Am. J. Clin. Nutr, Vol. 57 (Suppl.), 732S- 
73 78). Altered fatty acid metabolism and composition has been demonstrated 
in diabetic animals. These alterations have been suggested to be involved in 
some of the long-term complications resulting from diabetes, including 
retinopathy, neuropathy, nephropathy and reproductive system damage. 
25 Primrose oil, which contains GLA, has been shovm to prevent and reverse 
diabetic nerve damage. 

The PUFAs of the present invention can be used to treat eczema, reduce 
blood pressure and improve math scores. Essential fatty acid deficiency has 
been suggested as being involved in eczema, and studies have shown beneficial 
30 effects on eczema from treatment with GLA. GLA has also been shown to 
reduce increases in blood pressure associated with stress, and to improve 
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performance on arithmetic tests. GLA and DGLA have been shown to inhibit 
platelet aggregation, cause vasodilation, lower cholesterol levels and inhibit 
proliferation of vessel wall smooth muscle and fibrous tissue (Brenner et ai. 
Adv. Exp. Med. Biol. Vol. 83, p. 85-101, 1976). Administration of GLA or 
5 DGLA, alone or in combination with EPA, has been shown to reduce or prevent 
gastro-intestinal bleeding and other side effects caused by non-steroidal anti- 
inflammatory drugs (USPN 4,666,701). GLA and DGLA have also been shown 
to prevent or treat endometriosis and premenstrual syndrome (USPN 4,758,592) 
and to treat myalgic encephalomyelitis and chronic fatigue after viral infections 
10 (USPN 5,116,871). 

Further uses of the PUFAs of this invention include use in treatment of 
AIDS, multiple schlerosis, acute respiratory syndrome, hypertension and 
inflammatory skin disorders. The PUFAs of the inventions also can be used for 
formulas for general health as well as for geriatric treatments. 

15 Veterinary Applications 

It should be noted that the above-described pharmaceutical and 
nutritional compositions may be utilized in connection with animals, as well as 
humans, as animals experience many of the same needs and conditions as 
human. For example, the oil or acids of the present invention may be utilized in 
20 animal feed supplements. 

The following examples are presented by way of illustration, not of 
limitation. 

Examples 

Example 1 Isolation of a AS^lesattirase Nucleotide Sequence from 
25 Mortierella alpina 

Example 2 Expression of M alpina A5-desaturase Clones in Baker's 

Yeast 

Example 3 Initial Optimization of Culture Conditions 
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Example 4 Distribution of PUF As in Yeast Lipid Fractions 
Example 5 Further Culture Optimization 

Example 6 Identification of Homologues to M alpina A5 and A6 

desaturases 

5 Example 7 Identification of M alpina A5 and A6 homologues in 

other PUFA-producing organisms 

Example 8 Identification of M alpina A5 and A6 homologues in 

other PUFA-producing organisms 

Example 9 Human Desaturase Sequences 

1 0 Example 1 0 Nutritional Compositions 

Example 1 

Isolation of a AS-dcsaturase Nucleotide Sequence from Mortierella alpina 

Motierella alpina produces arachidonic acid (ARA, 20:4) from the 
precursor 20:3 by a A5-desaturase. A nucleotide sequence encoding the A5- 
1 5 desaturase fi-om Mortierella alpina was obtained through PCR amplification 
using M alpina 1 strand cDN A and degenerate oligonucleotide primers 
corresponding to amino acid sequences conserved between A6-desaturases from 
Synechocystis and Spirulina. The procedure used was as follows: 

Total RNA was isolated fi*om a 3 day old PUFA-producing culture of 
20 Mortierella alpina using the protocol of Hoge et al ( 1 982) Experimental 
Mycology 6:225-232. The RNA was used to prepare double-stranded cDNA 
using BRL's lambda-ZipLox system, following the manufacturer's instructions. 
Several size fi'actions of the M alpina cDNA were packaged separately to yield 
libraries with different average-sized inserts. The "fiill-liength" library contains 
25 approximately 3x10^ clones with an average insert size of 1 J7 kb. The 
"sequencing-grade" library contains approximately 6x10^ clones with an 
average insert size of 1 . 1 kb. 
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5|ag of total RNA was reverse transcribed using BRL Superscript RTase 
and the primer TSyn (5'-CCAAGCTTCTGCAGGAGCTCTTTTTTT 
TTTTTTTT-3'), SEQ ID NO: 10. Degenerate oligonucleotides were designed 
to regions conserved between the two cyanobacterial A6-desaturase sequences. 
5 The specific primers used were DeOESAT-FS (SEQ ID NO: 8) (5'- 

CUACUACUACUACAYCAYAC0TAYAC0AAYAT.3') and D6DESAT-R3 
(SEQ ID N0:9) (5^CAUCAUCAUCAUOGGRAAOARRTGRTG-3'), where 
Y=C+T, R=A+G, and 0=I+C. PGR amplification was carried out in a 25^1 
volume containing: template derived from 40 ng total RNA, 2 pM each primer, 

10 200 ^M each deoxyribonucleotide triphosphate, 60 mM Tris-Cl, pH 8.5, 15 mM 
(NH4)2S04, 2 mM MgCh. Samples were subjected to an initial denaturation 
step of 95 degrees (all temperatures Celsius) for 5 minutes, then held at 72 
degrees while 0.2 UofTaq polymerase were added. PGR thermocycling 
conditions were as follows: 94 degrees for 1 min., 45 degrees for 1 .5 min,, 72 

1 5 degrees for 2 min. PGR was continued for 35 cycles. PGR using these primers 
on the M. alpina first-strand cDNA produced a 550 bp reaction product. 
. Comparison of the deduced amino acid sequence of the M alpina PGR 
fragment SEQ ID N0:3 revealed regions of homology with A6-desaturases (see 
Figure 5). However, there was only about 28% identity over the region 

20 compared. 

The PGR product was used as a probe to isolate corresponding cDNA 
clones from a M alpina library. The longest cDNA clone, Ma29, was 
designated pGGN5521 and has been completely sequenced on both strands. 
The cDNA is contained as a 1481 bp insert in the vector pZLl (Bethesda 

25 Research Laboratories) and, beginning with the first ATG, contains an open 
reading frame encoding 446 amino acids. The reading frame contains the 
sequence deduced from the PGR fragment. The sequence of the cDNA insert 
was found to contain regions of homology to A6-desaturases (see Figure 5). For 
example, three conserved "histidine boxes" (that have been observed in 

30 membrane-bound desaturases (Okuley et al., (1994) The Plant Cell 6:147-158)) 
were found to be present in the Mortierella sequence at amino acid positions 
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171-175, 207-212, and 387-391 (see Figure 3). However, the typical 
"HXXHH" amino acid motif for the third histidine box for the Mortierella 
desaturase was found to be QXXHH, SEQ ID NO: 1 1-12. Surprisingly, the 
amino-terminus of the encoded protein, showed significant homology to 

5 cytochrome b5 proteins. Thus, the Mortierella cDN A clone appears to 

represent a fusion between a cytochrome b5 and a fatty acid desaturase. Since 
cytochrome b5 is believed to function as the electron donor for membrane- 
bound desaturase enzymes, it is possible that the N-terminal cytochrome b5 
domain of this desaturase protein is involved in its fimction. This may be 

10 advantageous when expressing the desaturase in heterologous systems for 
PUFA production. 

Example 2 

Expression of M. alpina Desaturase Clones in Baker^s Yeast 

Yeast Transformation 

1 5 Lithium acetate transformation of yeast was performed according to 

standard protocols {Methods in Enzymology, Vol. 194, p. 186-187, 1991). 
Briefly, yeast were grown in YPD at 30**C. Cells were spun down, resuspended 
in TE, spun down again, resuspended in TE containing 100 mM lithium acetate, 
spun down again, and resuspended in TE/lithium acetate. The resuspended 

20 yeast were incubated at 30*'C for 60 minutes with shaking. Carrier DNA was 
added, and the yeast were aliquoted into tubes. Transforming DNA was added, 
and the tubes were incubated for 30 min. at 30X. PEG solution (35% (w/v) 
PEG 4000, 100 mM lithium acetate, TE pH7.5) was added followed by a 50 
min. incubation at 30*'C. A 5 min. heat shock at 42'*C was performed, the cells 

25 were pelleted, washed with TE, pelleted again and resuspended in TE. The 
resuspended cells were then plated on selective media. 

Desaturase Expression in Transformed Yeast 

The cDNA clones from Mortierella alpina were screened for desaturase 
activity in baker's yeast. A canola Al 5-desaturase (obtained by PCR using 1'^ 
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Strand cDNA from Brassica napus cultivar 212/86 seeds using primers based on 
the published sequence (Axondel et al Science 258:1353-1355)) was used as a 
positive control. The A15-desaturase gene and the gene from cDNA clone 
Ma29 was inserted into the expression vector pYES2 (Invitrogen), resulting in 

5 plasmids pCGR-2 and pCGR-4, respectively. These plasmids were transfected 
into 5. cerevisiae yeast strain 334 and expressed after induction with galactose 
and in the presence of substrates that allowed detection of specific desaturase 
activity* The control strain was S. cerevisiae strain 334 containing the unaltered 
pYES2 vector. The substrates used, the products produced and the indicated 

1 0 desaturase activity were: DGLA (conversion to ARA would indicate A5- 
desaturase activity), linolenic acid (conversion to OLA would indicate A6- 
desaturase activity; conversion to ALA would indicate A15-desaturase activity), 
oleic acid (an endogenous substrate made by S. cerevisiae, conversion to ' 
linolenic acid would indicate A12-desaturase activity, which S, cerevisiae 

1 5 lacks), or ARA (conversion to EPA would indicate Al 7-desaturase activity). 
The results are provided in Table 1 below. The lipid fractions were extracted as 
follows: Cultures were grown for 48-52 hours at 1 5**C. Cells were pelleted by 
centrifrigation, washed once with sterile ddHjO, and repelleted. Pellets were 
vortexed with methanol; chloroform was added along with tritridecanoin (as an 

20 internal standard). The mixtures were incubated for at least one hour at room 
temperature or at 4°C overnight. The chloroform layer was extracted and 
filtered through a Whatman filter with one gram of anhydrous sodium sulfate to 
remove particulates and residual water. The organic solvents were evaporated 
at 40''C under a stream of nitrogen. The extracted lipids were then derivatized 

25 to fatty acid methyl esters (FAME) for gas chromatography analysis (GC) by 
adding 2 ml of 0.5 N potassium hydroxide in methanol to a closed tube. The 
samples were heated to 95**C to lOO^'C for 30 minutes and cooled to room 
temperature. Approximately 2 ml of 14 % boron trifluoride in methanol was 
added and the heating repeated. After the extracted lipid mixture cooled, 2 ml 

30 of water and 1 ml of hexane were added to extract the FAME for analysis by 
GC. The percent conversion was calculated by dividing the product produced 

-50- 



98/46765 



PCT/US98/07422 



by the sum of (the product produced and the substrate added) and then 
muhiplying by 100. To calculate the oleic acid percent conversion, as no 
substrate was added, the total linolenic acid produced was divided by the sum of 
(oleic acid and linolenic acid produced), then multiplying by 1 00. 
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Table 1 

M. alpina Desaturase Expression in Baker's Yeast 

CLONE TYPE OF ENZYME % CONVERSION 

ACTIVITY OF SUBSTRATE 

pCGR-2 A6 0 (18:2 to 18:3co6) 

(canolaAlS A15 16.3 (18:2 to 18:3co3) 

desaturase) A5 2.0 (20:3 to 20:4a>6) 

A17 2.8 (20:4 to 20:50)3) 

A12 1.8 (18:ltol8:2(o6) 

pCGR-4 A6 0 

(M alpina A15 0 

Ma29) A5 15.3 

A17 0.3 

A12 3.3 



5 The A15-desaturase control clone exhibited 16.3% conversion of the 

substrate. The pCGR-4 clone expressing the Ma29 cDNA converted 15.3% of 
the 20:3 substrate to 20:4co6, indicating that the gene encodes a A5 -desaturase. 
The background (non-specific conversion of substrate) was between 0-3% in 
these cases. We also found substrate inhibition of the activity by using 

1 0 different concentrations of the substrate. When substrate was added to 100 ^M, 
the percent conversion to product dropped compared to when substrate was 
added to 25 ^M (see below). Additionally, by varying the DGLA substrate 
concentrations, between about 5 jiM to about 200 fiM percent conversion of 
DGLA to ARA ranged from about 5% to 75% with the M. alpina A5- 

1 5 desaturase. 
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These data show that desaturases with different substrate specificities 
can be expressed in a heterologous system and used to produce poly-unsaturated 
long chain fatty acids. 

Table 2 represents fatty acids of interest as a percent of the total lipid 
5 extracted from the yeast host S, cerevisiae 334 with the indicated plasmid. No 
glucose was present in the growth media. Affinity gas chromatography was 
used to separate the respective lipids. GC/MS was employed to verify the 
identity product(s). The expected product for the B, napus A15-desaturase, a- 
linolenic acid, was detected when its substrate, linolenic acid, was added 

1 0 exogenously to the induced yeast culture. This finding demonstrates that yeast 
expression of a desaturase gene can produce fiinctionai enzyme and detectable 
amounts of product under the current growth conditions. Both exogenously 
added substrates were taken up by yeast, although slightly less of the longer 
chain PUFA, dihomo-y-linolenic acid (20:3), was incorporated into yeast than 

1 5 linolenic acid (1 8:2) when either was added in free form to the induced yeast 

cultures. Arachidonic acid was detected as a novel PUFA in yeast when 

dihomo-y-linolenic acid was added as the substrate to S. cerevisiae 334 (pCGR- 

4). This identifies pCGR-4 (MA29) as the A5-desaturase fi-om M alpina. Prior 

to this, no isolation and expression of a A5-desaturase from any source has been 
20 reported. 
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Example 3 
Optimization of Culture Conditions 

Table 3 A shows the effect of exogenous free fatty acid substrate 
concentration on yeast uptake and conversion to fatty acid product as a 

5 percentage of the total yeast lipid extracted. In all instances, low amounts of 
exogenous substrate (1-10 nM) resulted in low fatty acid substrate uptake and 
product formation. Between 25 and 50 |iM concentration of free fatty acid in 
the growth and induction media gave the highest percentage of fatty acid 
product formed, while the 1 00 (iM concentration and subsequent high uptake 

1 0 into yeast appeared to decrease or inhibit the desaturase activity. The feedback 
inhibition of high fatty acid substrate concentration was well illustrated when 
the percent conversion rates of the respective fatty acid substrates to their 
respective products were compared in Table 3B. In all cases, 100 ^iM substrate 
concentration in the growth media decreased the percent conversion to product. 

15 The effect of media composition was also evident when glucose was present in 

the growth media for the A5 -desaturase, since the percent of substrate uptake 
was decreased at 25 ^M (Table 3A). However, the percent conversion by A5- 
desaturase increased by 1 8% and the percent product formed remained the same 
in the presence of glucose in the growth media. 
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Table 3A 

EfTect of Added Substrate on the Percentage of Incorporated 
Substrate and Product Formed in Yeast Extracts 



Plasmid 


pCGR-2 


pCGR-4 


in Yeast 


(A15) 


(A5) 


substrate/product 


18:2 /a- 18:3 


20:3/20:4 


i sub. 


ND 


0.5/1.7 


\0\iM sub. 


ND 


3.3/4 


25 ^ M sub. 


ND 


5.1/6.1 


25 ^MO sub. 


36.6/7.20 


9.3/5.40 


50 sub. 


53.1/6.50 


ND 


100 sub. 


60.1/5.70 


32.3/5.80 



5 Table 3B 

EfTect of Substrate Concentration in Media on the Percent Conversion 
of Fatty Acid Substrate to Product in Yeast Extracts 



Plasmid in Yeast 


pCGR-2 
(A15) 


pCGR-4 

(A5) 


substrate/product 


18:2 -►a-lSrS 


20:3-^20:4 


1 sub. 


ND 


77.3 


10 sub. 


ND 


54.8 


25 sub. ' 


ND 


54.2 


25 ^MO sub. 


16.4 


36.7 


50 |iM sub. 


10.90 


ND 


100 ^iM sub. 


8.70 


15.20 



0 no glucose in media 

* Yeast peptone broth (YPD) 

* 18:1 is an endogenous yeast lipid 
sub. is substrate concentration 
ND (not done) 



Table 4 shows the amount of fatty acid produced by a recombinant 
desaturase from induced yeast cultures when different amounts of free fatty acid 
substrate were used. Fatty acid weight was determined since the total amount of 
lipid varied dramatically when the growth conditions were changed, such as the 
presence of glucose in the yeast growth and induction media. To better 
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determine the conditions when the recombinant desaturase would produce the 
most PUFA product, the quantity of individual fatty acids were examined. The 
absence of glucose reduced the amount of arachidonic acid produced by A5- 
desaturase by half. For A5-desaturase the amount of total yeast lipid was 
S decreased by almost half in the absence of glucose. 



Table 4 

Fatty Acid Produced in jig from Yeast Extracts 



Plasmid in Yeast 


pCGR-4 


pCGR-7 


(enzyme) 


(A5) 




(A12) 


product 


20:4 


18:2* 


1 sub. 


8.3 


ND 


10 sub. 


19.2 


ND 


25 sub. 


31.2 


115.7 


25 0 sub. 


16.8 


39 0 



0 no glucose in media 
1 0 sub. is substrate concentration 

ND (not done) 
18:1, the substrate, is an endogenous yeast lipid 

Example 4 

Distribution of PUFAs in Yeast Lipid Fractions 

1 5 Table 5 illustrates the uptake of free fatty acids and their new products 

formed in yeast lipids as distributed in the major lipid fractions. A total lipid 
extract was prepared as described above. The lipid extract was separated on 
TLC plates, and the fractions were identified by comparison to standards. The 
bands were collected by scraping, and internal standards were added. The 

20 fractions were then saponified and methylated as above, and subjected to gas 

chromatography. The gas chromatograph calculated the amount of fatty acid by 
comparison to a standard. It would appear that the substrates are accessible in 
the phospholipid form to the desaturases. 
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Table 5 

Fatty Acid Distribution in Various Yeast Lipid Fractions in 



Fatty acid 
fraction 


Phospholipid 


Diglyceride 


Free Fatty 
Acid 


Triglyceride 


Cholesterol 
Ester 


SC (pCGR-4) 
substrate 20:3 


15.1 


1.9 


22.9 


12.6 


3.3 


SC (pCGR.4) 
product 20:4 


42.6 


0.9 


6.8 


4.9 


0.4 



SC = S. cerevisiae (plasmid) 



5 Example 5 

Further Culture Optimization 

The growth and induction conditions for optimal activities of 
desaturases in Saccharomyces cerevisiae were evaluated. Various culture 
conditions that were manipulated for optimal activity were: I) induction 
1 0 temperature, ii) concentration of inducer, iii) timing of substrate addition, iv) 
concentration of substance, v) sugar source, vi) growth phase at induction. 
These studies were done using A5-desaturase gene from Mortierella alpina 
(MA 29). In addition, the effect of changing host strain on expression of the 
AS-desaturase gene was also determined. 

1 5 As described above, the best rate of conversion of substrate to ARA was 

observed at a substrate concentration of 1 jiM, however, the percentage of ARA 
in the total fatty acids was highest at 25 \xM substrate concentration. To 
determine if the substrate needed to be modified to a readily available form 
before it could be utilized by the desaturase, the substrate was added either 1 5 

20 hours before induction or concomitant with inducer addition (indicated as after, 

in Figure 6A). As it can be seen in Figure 6A, addition of substrate before 
induction did not have a significant effect on the activity of A5-desaturase. In 
fact, addition of substrate along with the inducer was slightly better for 
expression/activity of A5-desaturase, as ARA levels in the total fatty acids were 
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higher. However, the rate of conversion of substrate to product was slightly 
lower. 

The effect of inducer concentration of expression/activity of Moriierella 
A5-desaturase was examined by inducing SC334/pCGR5 with 0.5 or 2% (w/v) 
5 of galactose. As shovm in Figures 7A and 7B, expression of A5-desaturase was 

higher when induced with 0.5% galactose. Furthermore, rate of conversion of 
substrate to product was also better when SC334/pCGR5 was induced with 
0.5% galactose vs 2% galactose. 

To determine the effect of temperature on A5-desaturase activity, the 
1 0 SC334 host strain, transformed with pCGR5 (SC334/pCGR5) was grown and 

induced at 15° C, 25 X, 30°C and 37*^0. The quantity of ARA (20:4n6) 
produced in SC334/pCGR5 cultures, supplemental vrfth substrate 20:3n6, was 
measured by fatty acid analysis. Figure 8A depicts the quantity of 20:3n6 and 
20:4n6, expressed as percentage of total fatty acids. Figure 8B depicts the rate 
1 5 of conversion of substrate to product. Growth and induction of SC334/pCGR5 

at 25 °C, was the best for the expression of A5-desaturase as evidenced by the 
highest levels of arachidonic acid in the total fatty acids. Additionally the 
highest rate of conversion of substrate to product also occurred at 25**C. 
Growth and induction at 15**C gave the lowest expression of ARA, whereas at 
20 37°C gave the lowest conversion of substrate to product. 

The effect of yeast strain on expression of the A5-desaturase gene was 
studied in 5 different host strains; INVSCl, INVSC2, YTC34, YTC41, and 
SC334, at 15°C and 30**C. At 15X, SC334 has the highest percentage of ARA 
in total fatty acids, suggesting higher activity of A5-desaturase in SC334. The 
25 rate of conversion of substrate to product, however is lowest in SC334 and 

highest in INVSCl (Fig. 9A and B). At 30*'C, the highest percentage of product 
(ARA) in total fatty acids was observed in INVSC2, although the rate of 
conversion of substrate to product in INVSC2 was slightly lower than INVSCl 
(Fig. lOAandB). 
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ARA, the product of A5-desaturase, is stored in the phospholipid faction 
(Example 4). Therefore the quantity of ARA produced in yeast is limited by the 
amount that can be stored in the phospholipid fraction. If ARA could also be 
stored in other fractions such as the triglyceride fraction, the quantity of ARA 
5 produced in yeast might be increased. To test this hypothesis, the A5-desaturase 
gene was expressed in the yeast host strain DBY746 (obtained from the Yeast 
Genetic Stock Centre, 1021 Donner Laboratory, Berkeley, CA 94720. The 
genotype of strain DBY746 is Mata, his3-Al, leu2-3, leu2-l 12, ura3-32, trpl- 
289, gal). The DBY746 yeast strain has an endogenous gene for choline 
1 0 transferase. The presence of this enzyme might enable the DB Y746 strain to 
convert excess phospholipids into triglycerides fraction. Results in Fig. 1 1 
show no increase in the conversion of substrate to product as compared to 
SC334, which does not have the gene for choline transferase. 

To study the effect of media on expression of A5-desaturase, 
1 5 pCGR4/SC334 was grown in four different media at two different temperatures 

(15^C and 30^) and in two different host strains (SC334 and INVSCl). The 
composition of the media was as follows: 

Media A: mm-Ura, + 2% galactose + 2% glucose. 

Media B: mm-Ura, + 20% galactose + 2% Glucose + IM sorbitol (pH5.8) 

20 Media C: mm-Ura, + 2% galactose + 2% raffinose 

Media D: mm-Ura, + 2% galactose +2% raffinose + IM sorbitol (pH5.8) 

mm^minimal media 

Results show that the highest conversion rate of substrate to product at 
IS'^C in SC334 was observed in media A. The highest conversion rate overall 
25 for A5-desaturase in SC334 was at 30** in media D. The highest conversion rate 

of A5-desaturase in INVSCl was also at 30** in media D (Figures 12A and 12B). 

These data show that a DNA encoding a desaturase that can convert 
DGLA to ARA can be isolated from Moriierella alpina and can be expressed in 
a heterologous system and used to produce poly-unsaturated long chain fatty 
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acids. Exemplified is the production of ARA from the precursor DGLA by 
expression of a A5-desaturase in yeast. 

Example 6 

Identification of Homolo^ues to M. giving AS and A6 desaturascs 

5 A nucleic acid sequence that encodes a putative A5 desaturase was 

identified through a TBLASTN search of the est databases through NCBI using 
amino acids 100-446 of Ma29 as a query. The truncated portion of the Ma29 
sequence was used to avoid picking up homologies based on the cytochrome b5 
portion at the N-terminus of the desaturase. The deduced amino acid sequence 
10 of an est from Dictyostelium discoideum (accession # C25549) shows very 
significant homology to Ma29 and lesser, but still significant homology to 
Ma524. The DNA sequence is presented as SEQ ID NO: 13. The amino acid 
sequence is presented as SEQ ID NO: 14. 

Example 7 

15 Identification of M. glpina AS and A6 homolopues in other 

PUFA-producing organisms 

To look for desaturases involved in PUFA production, a cDNA library 
was constructed fh)m total RNA isolated from Phaeodactylum tricomutum. A 
plasmid-based cDNA library was constructed in pSPORTl (GIBCO-BRL) 
20 following manufacturer's instructions using a commercially available kit 
(GIBCO-BRL). Random cDNA clones were sequenced and nucleic acid 
sequences that encode putative A5 or A6 desaturases were identified through 
BLAST search of the databases and comparison to Ma29 and Ma524 sequences. 

One clone was identified from the Phaeodactylum library with 
25 homology to Ma29 and Ma524; it is called 144-01 1-B12. The DNA sequence is 

presented as SEQ ID NO: 15. The amino acid sequence is presented as SEQ ID 
NO: 16. 
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Example 8 

Identification of M. alpina AS and A6 homologues in other 

PUFA-producing organisms 

To look for desaturases involved in PUFA production, a cDNA library 
5 was constructed from total RNA isolated from Schizochytrium species. A 

plasmid-based cDNA library was constructed in pSPORTl (GIBCO-BRL) 
following manufacturer's instructions using a commercially available kit 
(GIBCO-BRL). Random cDNA clones were sequenced and nucleic acid 
sequences that encode putative A5 or A6 desaturases were identified through 
10 BLAST search of the databases and comparison to Ma29 and Ma524 sequences. 

One clone was identified from the Schizochytrium library with 
homology to Ma29 and Ma524; it is called 81-23-C7. This clone contains a ~1 
kb insert. Partial sequence was obtained from each end of the clone using the 
universal forward and reverse sequencing primers. The DNA sequence from 
15 the forward primer is presented as SEQ ID NO: 17. The peptide sequence is 

presented as SEQ ID NO: 18. The DNA sequence from the reverse primer is 
presented as SEQ ID NO: 19. The amino acid sequence from the reverse primer 
is presented as SEQ ID NO:20. 

Example 9 

20 Human Desaturase Gene SeQuences 

Human desaturase gene sequences potentially involved in long chain 
polyunsaturated fatty acid biosynthesis were isolated based on homology 
between the hiunan cDNA sequences and Mortierella alpina desaturase gene 
sequences. The three conserved "histidine boxes" known to be conserved 
25 among membrane-bound desaturases were found. As with some other 

membrane-bound desaturases the final HXXHH histidine box motif was found 
to be QXXHH. The amino acid sequence of the putative human desaturases 
exhibited homology to M alpina A5, A6, A9, and A 12 desaturases. 
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The M. alpina A5 desaturase and A6 desaturase cDNA sequences were 
used to search the LifeSeq database of Incyte Pharmaceuticals, Inc., Palo Alto, 
California 94304. The A5 desaturase sequence was divided into fragments; 1) 
amino acid no. 1-150, 2) amino acid no. 151-300, and 3) amino acid no. 301- 

5 446. The A6 desaturase sequence was divided into three fragments; 1) amino 
acid no. 1-150, 2) amino acid no. 151-300, and 3) amino acid no. 301-457. 
These polypeptide fragments were searched against the database using the 
"tblastn" algorithm. This alogarithm compares a protein query sequence against 
a nucleotide sequence database dynamically translated in all six reading frames 

1 0 (both strands). 

The polypeptide fragments 2 and 3 of M. alpina A5 and A6 have 
homologies with the ClonelD sequences as outlined in Table 6. The ClonelD 
represents an individual sequence from the Incyte LifeSeq database. After the 
"tblastn" results have been reviewed, Clone Information was searched with the 

1 5 default settings of Stringency of >=50, and Productscore <= 1 00 for different 

ClonelD numbers. The Clone Information Results displayed the information 
including the ClusterlD, ClonelD, Library, HitID, Hit Description. When 
selected, the ClusterlD number displayed the clone information of all the clones 
that belong in that ClusterlD. The Assemble command assembles all of the 

20 ClonelD which comprise the ClusterlD. The following default settings were 

used for GCG (Genetics Computer Group, University of Wisconsin 
Biotechnology Center, Madison, Wisconsin 53705) Assembly: 





Word Size: 


7 


25 


Minimum Overlap: 


14 




Stringency: 


0.8 




Minimum Identity: 


14 




Maximum Gap: 


10 




Gap Weight: 


8 


30 


Length Weight: 


2 
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GCG Assembly Results displayed the contigs generated on the basis of 
sequence information within the ClonelD. A contig is an alignment of DNA 
sequences based on areas of homology among these sequences. A new 
5 sequence (consensus sequence) was generated based on the aligned DNA 

sequences within a contig. The contig containing the ClonelD was identified, 
and the ambiguous sites of the consensus sequence was edited based on the 
alignment of the ClonelDs (see SEQ ID N0:21 - SEQ ID NO:25) to generate 
the best possible sequence. The procedure was repeated for all six ClonelD 

1 0 listed in Table 6. This produced five unique contigs. The edited consensus 

sequences of the 5 contigs were imported into the Sequencher software program 
(Gene Codes Corporation, Ann Arbor, Michigan 48 105). These consensus 
sequences were assembled. The contig 25 1 1 785 overlaps with contig 3506132, 
and this new contig was called 2535 (SEQ ID NO:27). The contigs from the 

1 5 Sequencher program were copied into the Sequence Analysis software package 

of GCG. 

Each contig was translated in all six reading frames into protein 
sequences. The Af. alpina A5 (MA29) and A6 (MA524) sequences were 
compared with each of the translated contigs using the FastA search (a Pearson 

20 and Lipman search for similarity between a query sequence and a group of 

sequences of the same type (nucleic acid or protein)). Homology among these 
sequences suggest the open reading frames of each contig. The homology 
among the M alpim A5 and A6 to contigs 2535 and 3854933 were utilized to 
create the final contig called 253538a. Figure 13 is the FastA match of the final 

25 contig 253538a and MA29, and Figure 14 is the FastA match of the final contig 
253538a and MA524. The DNA sequences for the various contigs are 
presented in SEQ ID N0:21 -SEQ ID NO:27. The various peptide sequences 
are shown in SEQ ID NO:28 - SEQ ID NO:34. 

Although the open reading frame was generated by merging the two 
30 contigs, the contig 2535 shows that there is a unique sequence in the beginning 
of this contig which does not match with the contig 3854933. Therefore, it is 
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possible that these contigs were generated from independent desaturase like 
human genes. 

The contig 253538a contains an open reading frame encoding 432 
amino acids. It starts with Gin (CAG) and ends with the stop codon (TGA). 
5 The contig 253538a aligns with both M alpina A5 and A6 sequences, 

suggesting that it could be either of the desaturases, as well as other known 
desaturases which share homology with each other. The individual contigs 
listed in Table 6, as well as the intermediate contig 2535 and the final contig 
253538a can be utilized to isolate the complete genes for human desaturases. 

1 0 Uses of the human desaturases 

These human sequences can be expressed in yeast and plants utilizing 
the procedures described in the preceding examples. For expression in 
mammalian cells and transgenic animals, these genes may provide superior 
codon bias. These human sequences can also be used to identify related 
15 desaturase sequences. 



Table 6 



Sections of the 
Desaturases 


Clone ID from LifeSeq Database 


Keyword 


151-300 A5 


3808675 


Fatty acid desaturase 


301-446 A5 


354535 


A6 


151-300 A6 


3448789 


A6 


151-300 A6 


1362863 


A6 


151-300 A6 


2394760 


A6 


301-457 A6 


3350263 


A6 



Example 10 

Nutritional Compositions 

20 The PUFAs of the previous examples can be utilized in various 

nutritional supplements, infant formulations, nutritional substitutes and other 
nutrition solutions. 

1. INFANT FORMULATIONS 
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A. Isomil® Soy Formula with Iron. 

Usage: As a beverage for infants, children and adults with an allergy or 
sensitivity to cow's milk. A feeding for patients with disorders for which 
lactose should be avoided: lactase deficiency, lactose intolerance and 
5 galactosemia. 

Features: 

• Soy protein isolate to avoid symptoms of cow's-milk-protein allergy 
or sensitivity 

• Lactose-free formulation to avoid lactose-associated diarrhea 

1 0 • Low osmolaity (240 mOsm/kg water) to reduce risk of osmotic 

diarrhea. 

■t 

• Dual carbohydrates (com syrup and sucrose) designed to enhance 
carbohydrate absorption and reduce the risk of exceeding the absorptive 
capacity of the damaged gut. 

15 • 1 .8 mg of Iron (as ferrous sulfate) per 100 Calories to help prevent 

iron deficiency. 

• Recommended levels of vitamins and minerals. 

• Vegetable oils to provide recommended levels of essential fatty 
acids. 

20 • Milk-white color, milk-like consistency and pleasant aroma. 

Ingredients: (Pareve, ®) 85% water, 4.9% com syrup, 2.6% sugar 
(sucrose), 2.1% soy oil, 1.9% soy protein isolate, 1.4% coconut oil, 0.15% 
calcium citrate, 0.1 1 % calcium phosphate tribasic, potassium citrate, potassium 
phosphate monobasic, potassium chloride, mono- and disglycerides, soy 
25 lecithin, carrageenan, ascorbic acid, L-methionine, magnesium chloride, 

potassium phosphate dibasic, sodium chloride, choline chloride, taurine, ferrous 
sulfate, m-inositol, alpha-tocopheryl acetate, zinc sulfate, L-camitine, 
niacinamide, calcium pantothenate, cupric sulfate, vitamin A palmitate, 
thiamine chloride hydrochloride, riboflavin, pyridoxine hydrochloride, folic 
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acid, manganese sulfate, potassium iodide, phylloquinone, biotin, sodium 
selenite, vitamin D3 and cyanocobalamin. 

B. Isomil® DF Soy Formula For Diarrhea. 

Usage: As a short-term feeding for the dietary management of diarrhea 
S in infants and toddlers. 

Features: 

• First infant formula to contain added dietary fiber from soy fiber 
specifically for diarrhea management. 

• Clinically shown to reduce the duration of loose, watery stools 
10 during mild to severe diarrhea in infants. 

• Nutritionally complete to meet the nutritional needs of the infant.. 

• Soy protein isolate with added L-methionine meets or exceeds an 
infant's requirement for ail essential amino acids. 

• Lactose-free formulation to avoid lactose-associated diarrhea. 

1 5 • Low osmolality (240 mOsm/kg water) to reduce the risk of osmotic 

diarrhea. 

• Dual carbohydrates (com syrup and sucrose) designed to enhance 
carbohydrate absorption and reduce the risk of exceeding the absorptive 
capacity of the damaged gut. 

20 • Meets or exceeds the vitamin and mineral levels recommended by 

the Committee on Nutrition of the American Academy of Pediatrics and 
required by the Infant Formula Act. 

• 1 ,8 mg of iron (as ferrous sulfate) per 1 00 Calories to help prevent 
iron deficiency. 

25 • Vegetable oils to provide reconunended levels of essential fatty 

acids. 

Ingredients: (Pareve, ®) 86% water, 4.8% com syrup, 2.5% sugar 
(sucrose), 2.1% soy oil, 2.0% soy protein isolate, 1.4% coconut oil, 0.77% soy 
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fiber, 0.12% calcium citrate, 0.1 1 % calcium phosphate tribasic, 0.10% 
potassium citrate, potassium chloride, potassium phosphate monobasic, mono- 
and disglycerides, soy lecithin, carrageenan, magnesium chloride, ascorbic acid, 
L-methionine, potassium phosphate dibasic, sodium chloride, choline chloride, 
5 taurine, ferrous sulfate, m-inositol, alpha-tocopheryl acetate, zinc sulfate, L- 

camitine, niacinamide, calcium pantothenate, cupric sulfate, vitamin A 
palmitate, thiamine chloride hydrochloride, riboflavin, pyridoxine 
hydrochloride, folic acid, manganese sulfate, potassium iodide, phylloquinone, 
biotin, sodium selenite, vitamin D3 and cyanocobalamin. 

1 0 C. Isomil® SF Sucrose-Free Soy Formula With Iron. 

Usage: As a beverage for infants, children and adults with an allergy or 
sensitivity to cow*s-milk protein or an intolerance to sucrose. A feeding for 
patients with disorders for which lactose and sucrose should be avoided. 

Features: 

1 5 • Soy protein isolate to avoid symptoms of cow's-milk-protein allergy 

or sensitivity. 

• Lactose-free formulation to avoid lactose-associated diarrhea 
(carbohydrate source is Polycose® Glucose Polymers). 

• Sucrose free for the patient who caimot tolerate sucrose. 

20 • Low osmolality ( 1 80 mOsm/kg water) to reduce risk of osmotic 

diarrhea. 

• 1.8 mg of iron (as ferrous sulfate) per 1 00 Calories to help prevent 
iron deficiency. 

• Recommended levels of vitamins and minerals. 

25 • Vegetable oils to provide recommended levels of essential fatty 

acids. 

• Milk-white color, milk-like consistency and pleasant aroma. 

Ingredients: (Pareve, ®) 75% water, 1 1 .8% hydrolized cornstarch, 4.1% 
soy oil, 4.1% soy protein isolate, 2.8% coconut oil, 1 .0% modified cornstarch, 
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0.38% calcium phosphate tribasic, 0.17% potassium citrate, 0.13% potassium 
chloride, mono- and disglycerides, soy lecithin, magnesium chloride, abscorbic 
acid, L-methionine, calcium carbonate, sodium chloride, choline chloride, 
carrageenan, taurine, ferrous sulfate, m-inositol, alpha-tocopheryl acetate, zinc 
sulfate, L-camitine, niacinamide, calcium pantothenate, cupric sulfate, vitamin 
A palmitate, thiamine chloride hydrochloride, riboflavin, pyridoxine 
hydrochloride, folic acid, manganese sulfate, potassium iodide, phylloquinone, 
biotin, sodium selenite, vitamin D3 and cyanocobalamin. 

D, Isomil® 20 Soy Formula With Iron Ready To Feed, 
20 Cal/fl oz. 

Usage: When a soy feeding is desired. 

Ingredients: (Pareve, ®) 85% water, 4.9% com syrup, 2.6% sugar 
(sucrose), 2.1% soy oil, 1.9% soy protein isolate, 1.4% coconut oil, 0.15% 
calcium citrate, 0.1 1% calcium phosphate tribasic, potassium citrate, potassium 
phosphate monobasic, potassium chloride, mono- and disglycerides, soy 
lecithin, carrageenan, abscorbic acid, L-methionine, magnesium chloride, 
potassium phosphate dibasic, sodium chloride, choline chloride, taurine, ferrous 
sulfate, m-inositol, alpha-tocopheryl acetate, zinc sulfate, L-camitine, 
niacinamide, calcium pantothenate, cupric sulfate, vitamin A palmitate, 
thiamine chloride hydrochloride, riboflavin, pyridoxine hydrochloride, folic 
acid, manganese sulfate, potassium iodide, phylloquinone, biotin, sodium 
selenite, vitamin D3 and cyanocobalamin. 

E. Simiiac® Infant Formula 

Usage: When an infant formula is needed: if the decision is made to 
discontinue breastfeeding before age 1 year, if a supplement to breastfeeding is 
needed or as a routine feeding if breastfeeding is not adopted. 

Features: 

• Protein of appropriate quality and quantity for good growth; heat- 
denatured, which reduces the risk of milk-associated enteric blood loss. 
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• Fat from a blend of vegetable oils (doubly homogenized), providing 
essential linoleic acid that is easily absorbed. 

• Carbohydrate as lactose in proportion similar to that of human milk. 

• Low renal solute load to minimize stress on developing organs. 

5 • Powder, Concentrated Liquid and Ready To Feed forms. 

Ingredients: (®-D) Water, nonfat milk, lactose, soy oil, coconut oil, 
mono- and diglycerides, soy lecithin, abscorbic acid, carrageenan, choline 
chloride, taurine, m-inositol, alpha-tocopheryl acetate, zinc sulfate, niacinamid, 
ferrous sulfate, calcium pantothenate, cupric sulfate, vitamin A palmitate, 
1 0 thiamine chloride hydrochloride, riboflavin, pyridoxine hydrochloride, folic 

acid, manganese sulfate, phylloquinone, biotin, sodium selenite, vitamin D3 and 
cyanocobalamin 

F. Similac® NcoCare Premature Infant Formula With Iron 

Usage: For premature infants' special nutritional needs after hospital 
1 5 discharge. Similac NeoCare is a nutritionally complete formula developed to 
provide premature infants with extra calories, protein, vitamins and minerals 
needed to promote catch-up growth and support development. 

Features: 

• Reduces the need for caloric and vitamin supplementation. More 
20 calories (22 Cal/fl oz) then standard term formulas (20 Cal/fl oz). 

• Highly absorbed fat blend, with medium-chain triglycerides (MCT 
oil) to help meet the special digestive needs of premature infants. 

• Higher levels of protein, vitamins and minerals per 1 00 Calories to 
extend the nutritional support initiated in-hospital. 

25 • More calcium and phosphorus for improved bone mineralization. 

Ingredients: ®-D Com syrup solids, nonfat milk, lactose, whey protein 
concentrate, soy oil, high-oleic safflower oil, fractionated coconut oil (medium- 
chain triglycerides), coconut oil, potassium citrate, calcium phosphate tribasic, 
calcium carbonate, ascorbic acid, magnesium chloride, potassium chloride, 
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sodium chloride, taurine, ferrous sulfate, m-inositol, choline chloride, ascorbyl 
palmitate, L-camitine, alpha-tocopheryl acetate, zinc sulfate, niacinamide, 
mixed tocopherols, sodium citrate, calcium pantothenate, cupric sulfate, 
thiamine chloride hydrochloride, vitamin A palmitate, beta carotene, riboflavin, 
5 pyridoxine hydrochloride, folic acid, manganese sulfate, phylloquinone, biotin, 
sodium selenite, vitamin D3 and cyanocobalamin. 

G. Similac Natural Care Low-Iron Human Milk Fortifier Ready 
To Use, 24 Cal/fi oz. 

Usage: Designed to be mixed with human milk or to be fed alternatively 
1 0 with human milk to low-birth-weight infants. 

Ingredients: ®-D Water, nonfat milk, hydrolyzed cornstarch, lactose, 
fractionated coconut oil (medium-chain triglycerides), whey protein , . 

concentrate, soil oil, coconut oil, calcium phosphate tribasic, potassium citrate, 
magnesium chloride, sodium citrate, ascorbic acid, calcium carbonate, mono- 

1 5 and diglycerides, soy lecithin, carrageenan, choline chloride, m-inositol, taurine, 

niacinamide, L-camitine, alpha tocopheryl acetate, zinc sulfate, potassiimi 
chloride, calcium pantothenate, ferrous sulfate, cupric sulfate, riboflavin, 
vitamin A palmitate, thiamine chloride hydrochloride, pyridoxine 
hydrochloride, biotin, folic acid, manganese sulfate, phylloquinone, vitamin D3, 

20 sodium selenite and cyanocobalamin. 

Various PUFAs of this invention can be substituted and/or added to the 
infant formulae described above and to other infant formulae known to those in 
the art.. 

11. NUTRITIONAL FORMULATIONS 
25 A. ENSURE® 

Usage: ENSURE is a low-residue liquid food designed primarily as an 
oral nutritional supplement to be used with or between meals or, in appropriate 
amounts, as a meal replacement, ENSURE is lactose- and gluten-free, and is 
suitable for use in modified diets, including low-cholesterol diets. Although it 
30 is primarily an oral supplement, it can be fed by tube. 
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Patient Conditions: 

• For patients on modified diets 

• For elderly patients at nutrition risk 

• For patients with involuntary weight loss 

5 • For patients recovering from illness or surgery 

• For patients who need a low-residue diet 
Ingredients: 

®-D Water, Sugar (Sucrose), Maltodextrin (Com), Calcium and Sodium 
Caseinates, High-Oleic Safflower Oil, Soy Protein Isolate, Soy Oil, Canola Oil, 

10 Potassium Citrate, Calcium Phosphate Tribasic, Sodium Citrate, Magnesium 

Chloride, Magnesium Phosphate Dibasic, Artificial Flavor, Sodium Chloride, 
Soy Lecithin, Choline Chloride, Ascorbic Acid, Carrageenan, Zinc Sulfate, 
Ferrous Sulfate, Alpha-Tocopheryl Acetate, Gellan Gum, Niacinamide, 
Calcium Pantothenate, Manganese Sulfate, Cupric Sulfate, Vitamin A 

1 5 Palmitate, Thiamine Chloride Hydrochloride, Pyridoxine Hydrochloride, 

Riboflavin, Folic Acid, Sodium Molybdate, Chromium Chloride, Biotin, 
Potassium Iodide, Sodium Selenate. 



B. ENSURE® BARS 

20 Usage: ENSURE BARS are complete, balanced nutrition for 

supplemental use between or with meals. They provide a delicious, nutrient- 
rich alternative to other snacks. ENSURE BARS contain <1 g lactose/bar, and 
Chocolate Fudge Brownie flavor is gluten-free. (Honey Graham Crunch flavor 
contains gluten.) 

25 Patient Conditions: 

• For patients who need extra calories, protein, vitamins and minerals 

• Especially useful for people who do not take in enough calories and 
nutrients 
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• For people who have the ability to chew and swallow 

• Not to be used by anyone with a peanut allergy or any type of allergy to 
nuts. 

Ingredients: 

5 Honey Graham Crunch - High-Fructose Com Syrup, Soy Protein 

Isolate, Brown Sugar, Honey, Maltodextrin (Com), Crisp Rice (Milled Rice, 
Sugar [Sucrose], Salt [Sodium Chloride] and Malt), Oat Bran, Partially 
Hydrogenated Cottonseed and Soy Oils, Soy Polysaccharide, Glycerine, Whey 
Protein Concentrate, Polydextrose, Fmctose, Calcium Caseinate, Cocoa 

1 0 Powder, Artificial Flafors, Canola Oil, High-Oleic Safflower Oil, Nonfat Dry 

Milk, Whey Powder, Soy Lecithin and Com Oil. Manufactured in a facility that 
processes nuts. . . 

Vitamins and Minerals: 

Calcium Phosphate Tribasic, Potassium Phosphate Dibasic, Magnesium 
1 5 Oxide, Salt (Sodium Chloride), Potassium Chloride, Ascorbic Acid, Ferric 

Orthophosphate, Alpha-Tocopheryl Acetate, Niacinamide, Zmc Oxide, Calcium 
Pantothenate, Copper Gluconate, Manganese Sulfate, Riboflavin, Beta- 
Carotene, Pyridoxine Hydrochloride, Thiamine Mononitrate, Folic Acid, Biotin, 
Chromium Chloride, Potassium Iodide, Sodium Selenate, Sodium Molybdate, 
20 Phylloquinone, Vitamin D3 and Cyanocobalamin. 

Protein: 

Honey Graham Crunch - The protein source is a blend of soy protein isolate 
and milk proteins. 

Soy protein isolate 74% 
25 Milk proteins 26% 

Fat: 

Honey Graham Crunch - The fat source is a blend of partially 
hydrogenated cottonseed and soybean, canola, high oleic safiQower, and com 
oils, and soy lecithin. 

-73- 



wo 98/46765 PCT/US98/07422 



Partially hydrogenated cottonseed and soybean oil 76% 



Canola oil 8% 

High-oleic safflower oil 8% 

Com oil 4% 

5 Soy lecithin 4% 

Carbohydrate: 



Honey Graham Crunch - The carbohydrate source is a combination of 
high-fructose com symp, brown sugar, maltodextrin, honey, crisp rice, 
glycerine, soy polysaccharide, and oat bran. 



1 0 High-fmctose com symp 24% 

Brown sugar 21% 

Maltodextrin 12% 

Honey 11% 

Crisp rice 9% 

1 5 Glycerine 9% 

Soy polysaccharide 7% 

Oat bran 7%\ 



C. ENSURE® HIGH PROTEIN 

20 Usage: ENSURE HIGH PROTEIN is a concentrated, high-protein 

liquid food designed for people who require additional calories, protein, 
vitamins, and minerals in their diets. It can be used as an oral nutritional 
supplement with or between meals or, in appropriate amounts, as a meal 
replacement. ENSURE HIGH PROTEIN is lactose- and gluten-free, and is 

25 suitable for use by people recovering from general surgery or hip fractures and 

by patients at risk for pressure ulcers. 

Patient Conditions 

• For patients who require additional calories, protein, vitamins, and minerals, 
such as patients recovering from general surgery or hip fractures, patients at risk 
30 for pressure ulcers, and patients on low-cholesterol diets 
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Features- 

• Low in saturated fat 

• Contains 6 g of total fat and < 5 mg of cholesterol per serving 

• Rich, creamy taste 

5 • Excellent source of protein, calcium, and other essential vitamins and 

minerals 

• For low-cholesterol diets 

• Lactose-free, easily digested 
Ingredients: 

10 Vanilla Supreme: -®-D Water, Sugar (Sucrose), Maltodextrin (Com), Calcium 
and Sodium Caseinates, High-Oleic Safflower Oil, Soy Protein Isolate, Soy Oil, 
Canola Oil, Potassium Citrate, Calcium Phosphate Tribasic, Sodium Citrate, 
Magnesium Chloride, Magnesium Phosphate Dibasic, Artificial Flavor, Sodium 
Chloride, Soy Lecithin, Choline Chloride, Ascorbic Acid, Carrageenan, Zinc 

15 Sulfate, Ferrous Suffate, Alpha-Tocopheryl Acetate, Gellan Gum, Niacinamide, 
Calciimi Pantothenate, Manganese Sulfate, Cupric Sulfate, Vitamin A 
Palmitate, Thiamine Chloride Hydrochloride, Pyridoxine Hydrochloride, 
Riboflavin, Folio Acid, Sodium Motybdate, Chromium Chloride, Biotin, 
Potassium Iodide, Sodium Selenate, Phylloquinone, Vitamin D.3 and 

20 Cyanocobalamin. 

Protein: 

The protein source is a blend of two high-biologic-value proteins: casein and 
soy. 

Sodium and calcium caseinates 85% 
25 Soy protein isolate 1 5% 

Fat: 

The fat source is a blend of three oils: high-oleic safflower, canola, and soy. 
High-oieic safflower oil 40% 
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Canola oil 30% 

Soy oil 30% 

The level of fat in ENSURE HIGH PROTEIN meets American Heart 
Association (AHA) guidelines. The 6 grams of fat in ENSURE HIGH 
5 PROTEIN represent 24% of the total calories, with 2.6% of the fat being from 

saturated fatty acids and 7.9% from polyunsaturated fatty acids. These values 
are within the AHA guidelines of < 30% of total calories from fat, < 1 0% of the 
calories from saturated fatty acids, and < 1 0% of total calories from 
polyunsaturated fatty acids. 

10 Carbohydrate: 

ENSURE HIGH PROTEIN contains a combination of maltodextrin and 
sucrose. The mild sweetness and flavor variety (vanilla supreme, chocolate 
royal, wild berry, and banana), plus VARI-FLAVORSO® Flavor Pacs in pecan, 
cherry, strawberry, lemon, and orange, help to prevent flavor fatigue and aid in 
1 5 patient compliance. 

Vanilla and other nonchocolate flavors 



Sucrose 60% 

Maltodextrin 40% 
Chocolate 

20 Sucrose 70% 

Mahodextrin 30% 



D. ENSURE ® LIGHT 

Usage: ENSURE LIGHT is a low-fat liquid food designed for use as an 
25 oral nutritional supplement with or between meals. ENSURE LIGHT is 

lactose- and gluten-free, and is suitable for use in modified diets, including low- 
cholesterol diets. 

Patient Conditions: 
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• For normal-weight or overweight patients who need extra nutrition in a 
supplement that contains 50% less fat and 20% fewer calories than ENSURE 

• For healthy adults who don't eat right and need extra nutrition 
Features: 

5 • Low in fat and saturated fat 

• Contains 3 g of total fat per serving and < 5 mg cholesterol 

• Rich, creamy taste 

• Excellent source of calcium and other essential vitamins and minerals 

• For low-cholesterol diets 

1 0 • Lactose-free, easily digested 
Ingredients: 

French Vanilla: ®-D Water, Maltodextrin (Com), Sugar (Sucrose), Calcium 
Caseinate, High-Oleic Safflower Oil, Canola Oil, Magnesium Chloride, Sodium 
Citrate, Potassixmi Citrate, Potassium Phosphate Dibasic, Magnesium Phosphate 

1 5 Dibasic, Natural and Artificial Flavor, Calcium Phosphate Tribasic, Cellulose 

Gel, Choline Chloride, Soy Lecithin, Carrageenan, Salt (Sodium Chloride), 
Ascorbic Acid, Cellulose Gum, Ferrous Sulfate, Alpha-Tocopheryl Acetate, 
Zinc Sulfate, Niacinamide, Manganese Sulfate, Calcium Pantothenate, Cupric 
Sulfate, Thiamine Chloride Hydrochloride, Vitamin A Palmitate, Pyridoxine 

20 Hydrochloride, Riboflavin, Chromium Chloride, Folic Acid, Sodium 

Molybdate, Biotin, Potassium Iodide, Sodium Selenate, Phylloquinone, Vitamin 
D3 and Cyanocobalamin. 

Protein: 

The protein source is calcium caseinate. 
25 Calcium caseinate 1 00% 

Fat 

The fat source is a blend of two oils: high-oleic safflower and canola. 
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High-oleic safflower oil 70% 

Canola oil 30% 

The level of fat in ENSURE LIGHT meets American Heart Association 
(AHA) guidelines. The 3 grams of fat in ENSURE LIGHT represent 13.5% of 
5 the total calories, with 1 .4% of the fat being from saturated fatty acids and 2.6% 

from polyunsaturated fatty acids. These values are within the AHA guidelines 
of < 30% of total calories from fat, < 1 0% of the calories from saturated fatty 
acids, and < 1 0% of total calories from polyunsaturated fatty acids. 

Carbohydrate 

1 0 ENSURE LIGHT contains a combination of maltodextrin and sucrose. 

The chocolate flavor contains com syrup as well. The mild sweetness and 
flavor variety (French vanilla, chocolate supreme, strawberry swirl), plus 
VARI-FLAVORS® Flavor Pacs in pecan, cherry, strawberry, lemon, and 
orange, help to prevent flavor fatigue and aid in patient compliance. 

1 5 Vanilla and other nonchocolate flavors 



Sucrose 5 1 % 

Maltodextrin 49% 
Chocolate 

Sucrose 47.0% 

20 Com Syrup 26.5% 

Maltodextrin 26.5% 



Vitamins and Minerals 

An 8-fl-oz serving of ENSURE LIGHT provides at least 25% of the 
RDIs for 24 key vitamins and minerals. 

25 Caffeine 

Chocolate flavor contains 2.1 mg caffeine/8 fl oz. 
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E. ENSURE PLUS® 

Usage: ENSURE PLUS is a high-calorie, low-residue liquid food for 
use when extra calories and nutrients, but a normal concentration of protein, are 
needed. It is designed primarily as an oral nutritional supplement to be used 
5 with or between meals or, in appropriate amounts, as a meal replacement. 

ENSURE PLUS is lactose- and gluten-free. Although it is primarily an oral 
nutritional supplement, it can be fed by tube. 

Patient Conditions: 

• For patients who require extra calories and nutrients, but a normal 
10 concentration of protein, in a limited volume 

• For patients who need to gain or maintain healthy weight 
Features 

• Rich, creamy taste 

• Good source of essential vitamins and minerals 
IS Ingredients 

Vanilla: ®-D Water, Com Syrup, Maltodextrin (Com), Com Oil, Sodium and 
Calcium Caseinates, Sugar (Sucrose), Soy Protein Isolate, Magnesium Chloride, 
Potassium Citrate, Calcium Phosphate Tribasic, Soy Lecithin, Natural and 
Artificial Flavor, Sodium Citrate, Potassium Chloride, Choline Chloride, 

20 Ascorbic Acid, Carrageenan, Zinc Sulfate, Ferrous Sulfate, Alpha-Tocopheryl 
Acetate, Niacinamide, Calcium Pantothenate, Manganese Sulfate, Cupric 
Sulfate, Thiamine Chloride Hydrochloride, Pyridoxine Hydrochloride, 
Riboflavin, Vitamin A Palmitate, Folic Acid, Biotin, Chromium Chloride, 
Sodium Molybdate, Potassium Iodide, Sodium Selenite, Phylloquinone, 

25 Cyanocobalamin and Vitamin D3. 

Protein 

The protein source is a blend of two high-biologic- value proteins: casein 
and soy. 

Sodium and calcium caseinates 84% 
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Soy protein isolate 1 6% 

Fat 

The fat source is com oil. 

Com oil 100% 
5 Carbohydrate 

ENSURE PLUS contains a combination of maltodextrin and sucrose. 
The mild sweetness and flavor variety (vanilla, chocolate, strawberry, coffee, 
buffer pecan, and eggnog), plus VARI-FLAVORS® Flavor Pacs in pecan, 
cherry, strawberry, lemon, and orange, help to prevent flavor fatigue and aid in 
1 0 patient compliance. 

Vanilla, strawberry, butter pecan, and coffee flavors 

Com Symp 39% 

Maltodextrin 38% 

Sucrose 23% 

1 5 Chocolate and eggnog flavors 

Com Symp 36% 

Maltodextrin 34% 

Sucrose 30% 

Vitamins and Minerals 

20 An 8-fl-oz serving of ENSURE PLUS provides at least 1 5% of the RDIs 

for 25 key Vitamins and minerals. 

Caffeine 

Chocolate flavor contains 3.1 mg Caffeine/8 fl oz. Coffee flavor 
contains a trace amount of caffeine. 



25 



F. ENSURE PLUS® HN 
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Usage: ENSURE PLUS HN is a nutritionally complete high-calorie, 
high-nitrogen liquid food designed for people with higher calorie and protein 
needs or limited volume tolerance. It may be used for oral supplementation or 
for total nutritional support by tube. ENSURE PLUS HN is lactose- and gluten- 
S free. 

Patient Conditions: 

• For patients with increased calorie and protein needs, such as following 
surgery or injury 

• For patients with limited volume tolerance and early satiety 
1 0 Features 

• For supplemental or total nutrition 

• For oral or tube feeding 

• l.SCaVmL 

• High nitrogen 

15 • Calorically dense 

Ingredients 

Vanilla: ®-D Water, Maltodextrin (Com), Sodium and Calciimi Caseinates, 
Com Oil, Sugar (Sucrose), Soy Protein Isolate, Magnesium Chloride, Potassium 
Citrate, Calcium Phosphate Tribasic, Soy Lecithin, Natural and Artificial 

20 Flavor, Sodium Citrate, Choline Chloride, Ascorbic Acid, Taurine, L-Camitine, 
Zinc Sulfate, Ferrous Sulfate, Alpha-Tocopheryl Acetate, Niacinamide, 
Carrageenan, Calcium Pantothenate, Manganese Sulfate, Cupric Sulfate, 
Thiamine Chloride Hydrochloride, Pyridoxine Hydrochloride, Riboflavin, 
Vitamin A Palmitate, Folic Acid, Biotin, Chromium Chloride, Sodium 

25 Molybdate, Potassium Iodide, Sodium Selenite, Phylloquinone, 

Cyanocobalamin and Vitamin D3. 



G. ENSURE® POWDER 



-81- 



98/46765 



PCT/US98/07422 



Usage: ENSURE POWDER (reconstituted with water) is a low-residue 
liquid food designed primarily as an oral nutritional supplement to be used with 
or between meals. ENSURE POWDER is lactose- and gluten-free, and is 
suitable for use in modified diets, including low-cholesterol diets. 

Patient Conditions: 

• For patients on modified diets 

• For elderly patients at nutrition risk 

• For patients recovering from illness/surgery 

• For patients who need a low-residue diet 
Features 

• Convenient, easy to mix 

• Low in saturated fat 

• Contains 9 g of total fat and < 5 mg of cholesterol per serving 

• High in vitamins and minerals 

• For low-cholesterol diets 

• Lactose-free, easily digested 

Ingredients: ®-D Com Syrup, Maltodextrin (Com), Sugar (Sucrose), Com Oil, 
Sodium and Calcium Caseinates, Soy Protein Isolate, Artificial Flavor, 
Potassium Citrate, Magnesium Chloride, Sodium Citrate, Calcium Phosphate 
Tribasic, Potassium Chloride, Soy Lecithin, Ascorbic Acid, Choline Chloride, 
Zinc Sulfate, Ferrous Sulfate, Alpha-Tocopheryl Acetate, Niacinamide, 
Calcium Pantothenate, Manganese Sulfate, Thiamine Chloride Hydrochloride, 
Cupric Sulfate, Pyridoxine Hydrochloride, Riboflavin, Vitamin A Palmitate, 
Folic Acid, Biotin, Sodium Molybdate, Chromium Chloride, Potassium Iodide, 
Sodium Selenate, Phylloquinone, Vitamin D3 and Cyanocobalamin. 

Protein 

The protein source is a blend of two high-biologic-value proteins: casein 
and soy. 
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Sodium and calcium caseinates 



84% 



Soy protein isolate 



16% 



Fat 



The fat source is com oil. 



Com oil 



100% 



10 



15 



20 



Carbohydrate 

ENSURE POWDER contains a combination of com symp, 
maltodextrin, and sucrose. The mild sweetness of ENSURE POWDER, plus 
VARI-FLAVORS® Flavor Pacs in pecan, cherry, strawberry, lemon, and 
orange, helps to prevent flavor fatigue and aid in patient compliance. 

Vanilla 

Com Syrup 35% 

Maltodextrin 35% 

Sucrose 30% 

H. ENSURE® PUDDING 

Usage: ENSURE PUDDING is a nutrient-dense supplement providing 
balanced nutrition in a nonliquid form to be used with or between meals. It is 
appropriate for consistency-modified diets (e.g., soft, pureed, or full liquid) or 
for people with swallowing impairments. ENSURE PUDDING is gluten-free. 

Patient Conditions: 

• For patients on consistency-modified diets (e.g., soft, pureed, or fiill liquid) 

• For patients with swallowing impairments 

• Features 

• Rich and creamy, good taste 

• Good source of essential vitamins and minerals Convenient-needs no 
refrigeration 
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• Gluten-free 

Nutrient Profile per 5 oz: Calories 250, Protein 10.9%, Total Fat 34.9%, 
Carbohydrate 54.2% 

Ingredients: 

5 Vanilla: ®-D Nonfat Milk, Water, Sugar (Sucrose), Partially Hydrogenated 

Soybean Oil, Modified Food Starch, Magnesium Sulfate. Sodium Stearoyl 
Lactylale, Sodium Phosphate Dibasic, Artificial Flavor, Ascorbic Acid, Zinc 
Sulfate, Ferrous Sulfate, Alpha-Tocopheryl Acetate, Choline Chloride, 
Niacinamide, Manganese Sulfate, Calcium Pantothenate, FD&C Yellow #5, 

10 Potassium Citrate, Cupric Sulfate, Vitamin A Palmitate, Thiamine Chloride 

Hydrochloride, Pyridoxine Hydrochloride, Riboflavin, FD&C Yellow #6, Folic 
Acid, Biotin, Phylloquinone, Vitamin D3 and Cyanocobalamin. 

Protein 

The protein source is nonfat milk. 
15 Nonfat milk 100% 

Fat 

The fat source is hydrogenated soybean oil. 
Hydrogenated soybean oil 1 00% 

Carbohydrate 

20 ENSURE PUDDING contains a combination of sucrose and modified 

food starch. The mild sweetness and flavor variety (vanilla, chocolate, 
butterscotch, and tapioca) help prevent flavor fatigue. The product contains 9.2 
grams of lactose per serving. 

Vanilla and other nonchocolate flavors 



25 Sucrose 56% 

Lactose 27% 

Modified food starch 1 7% 
Chocolate 
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Sucrose 



58% 



Lactose 



26% 



Modified food starch 



16% 



10 



15 



20 



I. ENSURE® WITH FIBER 

Usage: ENSURE WITH FIBER is a fiber-containing, nutritionally 
complete liquid food designed for people who can benefit from increased 
dietary fiber and nutrients. ENSURE WITH FIBER is suitable for people who 
do not require a low-residue diet. It can be fed orally or by tube, and can be 
used as a nutritional supplement to a regular diet or, in appropriate amounts, as 
a meal replacement. ENSURE WITH FIBER is lactose- and gluten-free, and is 
suitable for use in modified diets, including low-cholesterol diets. 

Patient Conditions 

• For patients who can benefit from increased dietary fiber and nutrients 
Features 

• New advanced formula-low in saturated fat, higher in vitamins and minerals 

• Contains 6 g of total fat and < 5 mg of cholesterol per serving 

• Rich, creamy taste 

• Good source of fiber 

• Excellent sotirce of essential vitamins and minerals 

• For low-cholesterol diets 

• Lactose- and gluten-free 
Ingredients 

Vanilla: ®-D Water, Maltodextrin (Com), Sugar (Sucrose), Sodium and 
Calcium Caseinates, Oat Fiber, High-Oleic Safflower Oil, Canola Oil, Soy 
Protein Isolate, Com Oil, Soy Fiber, Calcium Phosphate Tribasic, Magnesium 
Chloride, Potassium Citrate, Cellulose Gel, Soy Lecithin, Potassium Phosphate 
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Dibasic, Sodium Citrate, Natural and Artificial Flavors, Choline Chloride, 
Magnesium Phosphate, Ascorbic Acid, Cellulose Gum, Potassium Chloride, 
Carrageenan, Ferrous Sulfate, Alpha-Tocopheiyl Acetate, Zinc Sulfate, 
Niacinamide, Manganese Sulfate, Calcium Pantothenate, Cupric Sulfate, 
5 Vitamin A Palmitate, Thiamine Chloride Hydrochloride, Pyridoxine 

Hydrochloride, Riboflavin, Folic Acid, Chromium Chloride, Biotin, Sodium 
Molybdate, Potassium Iodide, Sodium Selenate, Phylloquinone, Vitamin D3 and 
Cyanocobalamin. 

Protein 

10 The protein source is a blend of two high-biologic- value proteins- casein 

and soy. 

Sodium and calcium caseinates 80% 



Soy protein isolate 20% 



Fat 



15 The fat source is a blend of three oils: high-oleic safflower, canola, and 

com. 

High-oleic safflower oil 40% 
Canola oil 40% 
Com oil 20% 

20 The level of fat in ENSURE WITH FIBER meets American Heart 

Association (AHA) guidelines. The 6 grams of fat in ENSURE WITH FIBER 
represent 22% of the total calories, with 2.01 % of the fat being from saturated 
fatty acids and 6.7% from polyunsaturated fatty acids. These values are within 
the AHA guidelines of < 30% of total calories from fat, < 1 0% of the calories 

25 from saturated fatty acids, and < 1 0% of total calories from polyunsaturated 

fatty acids. 

Carbohydrate 

ENSURE WITH FIBER contains a combination of maltodextrin and 
sucrose. The mild sweetness and flavor variety (vanilla, chocolate, and butter 
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pecan), plus VARI-FLAVORS® Flavor Pacs in pecan, cheny, strawberry, 
lemon, and orange, help to prevent flavor fatigue and aid in patient compliance. 

Vanilla and other nonchocolate flavors 

Maltodextrin 66% 



Sucrose 25% 



0 



Oat Fiber 7% 
Soy Fiber 2% 
Chocolate 

Maltodextrin 55% 
10 Sucrose 36% 

Oat Fiber 7% 
Soy Fiber 2% 

Fiber 

The fiber blend used in ENSURE WITH FIBER consists of oat fiber and 
1 5 soy polysaccharide. This blend results in approximately 4 grams of total dietary 

fiber per 8-fl-oz can. The ratio of insoluble to soluble fiber is 95:5. 

The various nutritional supplements described above and known to 
others of skill in the art can be substituted and/or supplemented with the PUFAs 
of this invention. 

20 J, Oxepa™ Nutritional Product 

Oxepa is low-carbohydrate, calorically dense enteral nutritional product 
designed for the dietary management of patients with or at risk for ARDS. It 
has a unique combination of ingredients, including a patented oil blend 
containing eicosapentaenoic acid (EPA from fish oil), y-linolenic acid (GLA 
25 fi-om borage oil), and elevated antioxidant levels. 

Caloric Distribution: 
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• Caloric density is high at 1.5 Cal/mL (355 Cal/8 fl oz), to minimize the 
volume required to meet energy needs. 



The distribution of Calories in Oxepa is shown in Table 7. 



Table 7. Caloric Distribution of Oxepa 




per 8 fl oz. 


per liter 


VoofCal 


Calories 


355 


1,500 




Fat (g) 


22.2 


93.7 


55.2 


Carbohydrate (g) 


25 


105.5 


28.1 


Protein (g) 


14.8 


62.5 


16,7 


Water (g) 


186 


785 





5 Fat: 

• Oxepa contains 22.2 g of fat per 8-fl oz serving (93.7 g/L). 

• The fat source is a oil blend of 3 1.8% canola oil, 25% medium-chain 
triglycerides (MCTs), 20% borage oil, 20% fish oil, and 3.2 % soy lecithin. The 
typical fatty acid profile of Oxepa is shown in Table 8. 

1 0 • Oxepa provides a balanced amount of polyunsaturated, monounsaturated, 
and saturated fatty acids, as shown in Table 10. 

• Medium-chain trigylcerides (MCTs) - 25% of the fat blend - aid gastric 
emptying because they are absorbed by the intestinal tract vdthout 
emulsification by bile acids. 

1 5 The various fatty acid components of Oxepa""^^ nutritional product can 

be substituted and/or supplemented with the PUFAs of this invention. 



Table 8. Typical Fatty Acid Profile 




% Total Fatty 
Acids 


g/8 fl oz* 


g/L* 


Caproic (6:0) 


0.2 


0.04 


0.18 


Caprylic (8:0) 


14.69 


3.1 


13.07 


Capric(10:0) 


11.06 


2.33 


9.87 


Palmitic (16.0) 


5.59 


1.18 


4,98 


Palmitoleic(16:ln-7) 


1.82 


0.38 


1.62 


Stearic (18:0) 


1.84 


0.39 


1.64 



-88- 



wo 98/46765 PCT/US98/07422 



Oleic{18:ln-9) 


24.44 


5.16 


21.75 


Linoleic (18:2n-6) 


16.28 


3.44 


14.49 


a-Linolenic (18:3n-3) 


3.47 


0.73 


3.09 


Y-Linoienic (18:3n-6) 


4.82 


1.02 


4.29 


Eicosapentaenoic (20:5n- 
3) 


5.11 


1.08 


4.55 


n-3'Docosapentaenoic 
(22:5n-3) 


0.55 


0.12 


0.49 


Docosahexaenoic (22:6n- 
3) 


2.27 


0.48 


2.02 


Others 


7.55 


1.52 


6.72 


* Fatty acids equal approximately 95% of total fat. 


Table 9. Fat Profile of Oxepa. 


% of total calories from fat 


55.2 


Polyunsaturated fatty acids 


3 1 .44 g/L 


Monounsaturated fatty acids 


25.53 g/L 


Saturated fatty acids 


32.38 g/L 


n-6 to n-3 ratio 


1.75:1 


Cholesterol 


9.49 mg/8 fl 02 
40.1 mg/L 


Carbohydrate: 



• The carbohydrate content is 25.0 g per 8-fl-02 serving (1 05.5 g/L). 

5 • The carbohydrate sources are 45% maitodextrin (a complex carbohydrate) 

and 55% sucrose (a simple sugar), both of which are readily digested and 
absorbed. 

• The high-fat and low-carbohydrate content of Oxepa is designed to 
minimize carbon dioxide (CO2) production. High CO2 levels can complicate 

10 weaning in ventilator-dependent patients. The low level of carbohydrate also 
may be useful for those patients who have developed stress-induced 
hyperglycemia. 

• Oxepa is lactose-free. 

Dietary carbohydrate, the amino acids from protein, and the glycerol 
1 5 moiety of fats can be converted to glucose within the body. Throughout this 

process, the carbohydrate requirements of glucose-dependent tissues (such as 
the central nervous system and red blood cells) are met. However, a diet free of 
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carbohydrates can lead to ketosis, excessive catabolism of tissue protein, and 
loss of fluid and electrolytes. These effects can be prevented by daily ingestion 
of 50 to 100 g of digestible carbohydrate, if caloric intake is adequate. The 
carbohydrate level in Oxepa is also sufficient to minimize gluconeogenesis, if 
5 energy needs are being met. 

Protein: 

• Oxepa contains 14.8 g of protein per 8-fl-oz serving (62.5 g/L). 

• The total calorie/nitrogen ratio (150:1) meets the need of stressed patients. 

• Oxepa provides enough protein to promote anabolism and the maintenance 
10 of lean body mass without precipitating respiratory problems. High protein 

intakes are a concern in patients with respiratory insufficiency. Although 
protein has little effect on CO2 production, a high protein diet will increase 
ventilatory drive. 

• The protein sources of Oxepa are 86.8% sodium caseinate and 13.2% 
1 5 calcium caseinate. 

All publications and patent applications mentioned in this specification 
are indicative of the level of skill of those skilled in the art to which this 
invention pertains. All publications and patent applications are herein 
20 incorporated by reference to the same extent as if each individual publication or 
patent application was specifically and individually indicated as incorporated by 
reference. 

The invention now being fully described, it will be apparent to one of 
ordinary skill in the art that many changes and modifications can be made 
25 thereto without departing from the spirit or scope of the appended claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

5 

U) APPLICANT: KNUTZON, DEBORAH 

MURKERJI, PRADIP 
HUANG, YUNG-SHENG 
THURMOND, JENNIFER 
10 CHAUDHARY, SUNITA 

LEONARD, AMANDA 



15 



55 



60 



(ii) TITLE OF INVENTION: METHODS AND COMPOSITIONS FOR 

SYJITHESIS OF LONG CHAIN POLY -UNSATURATED FATTY ACIDS 

(iii) NUMBER OF SEQUENCES: 34 



(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: LIMBACH & LIMBACH LLP 
20 (B) STREET: 2001 FERRY BUILDING 

(C) CITY: SAN FRANCISCO 

(D) STATE: CALIFORNIA 

(E) COUNTRY: USA 
(P) ZIP: 94111 

25 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible ■ 

(C) OPERATING SYSTEM: PC- DOS /MS -DOS 

30 (D) SOFTWARE: PatentIn Release #1.0, Version ttl.30 

<vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 
35 (C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: MICHAEL R. WARD 

(B) REGISTRATION NUMBER: 36,651 

40 (C) REFERENCE/DOCKET NUMBER: CGAB-110 

iix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: U15) 433-4150 

(B) TELEFAX: (415) 433-8716 
45 (C) TELEX: N/A 



(2) INFORMATION FOR SEQ ID N0:1: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1483 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 
GCTTCCTCCA GTTCATCCTC CATTTCGCCA CCTGCATTCT TTACGACCGT TAAGCAAGAT 60 
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10 



20 



30 



GGGAACGGAC CAAGGAAAAA CCTTCACCTG GGAAGAGCTG GCGGCCCATA ACACCAAGGA 120 

CGACCTACTC TTGGCCATCC GCGGCAGGGT GTACGATGTC ACAAAGTTCT TGAGCCGCCA 180 

TCCTGGTGGA GTGGACACTC TCCTGCTCGG AGCTGGCCGA GATGTTACTC CGGTCTTTGA 240 

GATGTATCAC GCGTTTGGGG CTGCAGATGC CATTATGAAG AAGTACTATG TCGGTACACT 300 

GGTCTCGAAT GAGCTGCCCA TCTTCCCGGA GCCAACGGTG TTCCACAAAA CCATCAAGAC 360 

GAGAGTCGAG GGCTACTTTA CGGATCGGAA CATTGATCCC AAGAATAGAC CAGAGATCTG 420 

GGGACGATAC GCTCTTATCT TTGGATCCTT GATCGCTTCC TACTACGCGC AGCTCTTTGT 4 80 

15 GCCTTTCGTT GTCGAACGCA CATGGCTTCA GGTGGTGTTT GCAATCATCA TGGGATTTGC 540 

GTGCGCACAA GTCGGACTCA ACCCTCTTCA T6ATGCGTCT CACTTTTCAG TGACCCACAA 600 

CCCCACTGTC TGGAAGATTC TGGGAGCCAC GCACGACTTT TTCAACGGAG CATCGTACCT 660 

GGTGTGGATG TACCAACATA TGCTCGGCCA TCACCCCTAC ACCAACATTG CTGGAGCAGA 720 

TCCCGACX3TG TCGACGTCTG AGCCCGATGT TCGTCGTATC AAGCCCAACC AAAAGTGGTT 780 

25 TGTCAACCAC ATCAACCAGC ACATGTTTGT TCCTTTCCTG TACGGACTGC TGGCGTTCAA i340 

GGTGCGCATT CAGGACATCA ACATTTTGTA CTTTGTCAAG ACCAATGACG CTATTCGTGT 900 

CAATCCCATC TCGACATG6C ACACTGTGAT GTTCTGGGGC GGCAAGGCTT TCTTTGTCTG 960 

GTATCGCCTG ATTGTTCCCC TGCAGTATCT GCCCCTGGGC AAGGTGCTGC TCTTGTTCAC 1020 

GGTCGCG6AC ATGGTGTCGT CTTACTGGCT GGCGCTGACC TTCCAGGCGA ACCACGTTGT 1080 

35 " TGAGGAAGTT CAGTGGCCGT TGCCTGACGA GAACGGGATC ATCCAAAAGG ACTGGGCAGC 1140 

TATGCAGGTC GAGACTACGC AGGATTACGC ACACGATTCG CACCTCTGGA CCAGCATCAC 1200 

TGGCAGCTTG AACTACCAGG CTGTGCACCA TCTGTTCCCC AACGTGTCGC AGCACCATTA 1260 

TCCCGATATT CTGGCCATCA TCAAGAACAC CTGCAGCGAG TACAAGGTTC CATACCTTGT 1320 

CAAGGATACG TTTTGGCAAG CATTTGCTTC ACATTTGGAG CACTTGCGTG TTCTTGGACT 1380 

45 CCGTCCCAAG GAAGAGTAGA AGAAAAAAAG CGCCGAATGA AGTATTGCCC CCTTTTTCTC 144 0 

CAAGAATGGC AAAAGGAGAT CAAGTGGACA TTCTCTATGA AGA 1483 
(2) INFORMATION FOR SEQ ID NO: 2: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 446 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 
55 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

60 <xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

Met Gly Thr Asp Gin Gly Lys Thr Phe Thr Trp Glu Glu Leu Ala Ala 
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15 10 15 

His Aan Thr Lys Asp Asp Leu Leu Leu Ala lie Arg Gly Arg Val Tyr 

20 25 30 

Asp Val Thr Lys Phe Leu Ser Arg His Pro Gly Gly Val Asp Thr Leu 
35 40 45 

Leu Leu Gly Ala Gly Arg Asp Val Thr Pro Val Phe Glu Met Tyr His 
50 55 60 

Ala Phe Gly Ala Ala Asp Ala lie Met Lys Lys Tyr Tyr Val Gly Thr 
65 70 75 80 

Leu Val Ser Asn Glu Leu Pro lie Phe Pro Glu Pro Thr Val Phe His 

85 90 95 

Lys Thr lie Lys Thr Arg Val Glu Gly Tyr Phe Thr Asp Arg Asn lie 

100 105 110 

Asp Pro Lys Asn Arg Pro Glu lie Trp Gly Arg Tyr Ala Leu lie Phe 
115 120 125 

Gly Ser Leu lie Ala Ser Tyr Tyr Ala Gin Leu Phe Val Pro Phe Val. 
130 135 140 

Val Glu Arg Thr Trp Leu Gin Val Val Phe Ala lie lie Met Gly Phe 
145 150 155 160 

Ala Cys Ala Gin Val Gly Leu Asn Pro Leu His Asp Ala Ser His Phe 

165 170 175 

Ser Val Thr His Asn Pro Thr Val Trp Lys He Leu Gly Ala Thr His 

180 185 190 

Asp Phe Phe Asn Gly Ala Ser Tyr Leu Val Trp Met Tyr Gin His Met 
195 200 205 

Leu Gly His His Pro Tyr Thr Asn He Ala Gly Ala Asp Pro Asp Val 
210 215 220 

Ser Thr Ser Glu Pro Asp Val Arg Arg He Lys Pro Asn Gin Lys Trp 
225 230 235 240 

Phe Val Asn His He Asn Gin His Met Phe Val Pro Phe Leu Tyr Gly 

245 250 255 

Leu Leu Ala Phe Lys Val Arg He Gin Asp He Asn He Leu Tyr Phe 

260 265 270 

Val Lys Thr Asn Asp Ala He Arg Val Asn Pro He Ser Thr Trp His 
275 280 285 

Thr Val Met Phe Trp Gly Gly Lys Ala Phe Phe Val Trp Tyr Arg Leu 
290 295 300 

He Val Pro Leu Gin Tyr Leu Pro Leu Gly Lys Val Leu Leu Leu Phe 
305 310 315 320 

Thr Val Ala Asp Met Val Ser Ser Tyr Trp Leu Ala Leu Thr Phe Gin 

325 330 335 
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Ala Asn His Val Val Glu Glu Val Gin Trp Pro Leu Pro Asp Glu Asn 

340 345 350 

Gly He He Gin Lys Asp Trp Ala Ala Met Gin Val Glu Thr Thr Gin 
355 360 365 

Asp Tyr Ala His Asp Ser His Leu Trp Thr Ser He Thr Gly Ser Leu 
370 375 3B0 

Asn Tyr Gin Ala Val His His Leu Phe Pro Asn Val Ser Gin His His 
385 390 395 400 

Tyr Pro Asp He Leu Ala He He Lys Asn Thr Cys Ser Glu Tyr Lye 

405 410 415 

Val Pro Tyr Leu Val Lys Asp Thr Phe Trp Gin Ala Phe Ala Ser Hie 

420 425 430 

Leu Glu His Leu Arg Val Leu Gly Leu Arg Pro Lys Glu Glu 
435 440 445 

INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 186 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



<xi) SEQ17ENCE DESCRIPTION: SEQ ID N0:3: 

Leu His His Thr Tyr Thr Asn He Ala Gly Ala Asp Pro Asp Val Ser 
15 10 15 

Thr Ser Glu Pro Asp Val Arg Arg He Lys Pro Asn Gin Lys Trp Phe 

20 25 30 

Val Asn His He Aen Gin His Met Phe Val Pro Phe Leu Tyr Gly Leu 
35 40 45 

Leu Ala Phe Lys Val Arg He Gin Asp He Asn He Leu Tyr Phe Val 
50 55 60 

Lys Thr Asn Asp Ala He Arg Val Asn Pro He Ser Thr Trp His Thr 
65 70 75 80 

Val Met Phe Trp Gly Gly Lys Ala Phe Phe Val Trp Tyr Arg Leu He 

65 90 95 

Val Pro Leu Gin Tyr Leu Pro Leu Gly Lys Val Leu Leu Leu Phe Thr 

100 105 110 

Val Ala Asp Met Val Ser Ser Tyr Trp Leu Ala Leu Thr Phe Gin Ala 
115 120 125 

Asn Tyr Val Val Glu Glu Val Gin Trp Pro Leu Pro Asp Glu Asn Gly 
130 135 140 

He He Gin Lys Asp Trp Ala Ala Met Gin Val Glu Thr Thr Gin Asp 
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145 150 155 160 

Tyr Ala Hi© Asp Ser His Leu Trp Thr Ser lie Thr Gly Ser Leu Asn 

165 170 175 

Tyr Gin Xaa Val His His Leu Phe Pro His 

180 185 

(2) INFORMATION FOR SEQ ID NO: 4: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 457 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

Met Ala Ala Ala Pro Ser Val Arg Thr Phe Thr Arg Ala Glu Val Leu 
15 10 IS 

25 Asn Ala Glu Ala Leu Asn Glu Gly Lys Lys Asp Ala Glu Ala Pro Phe 

20 25 30 



Leu Met lie He Asp Asn Lys Val Tyr Asp Val Arg Glu Phe Val Pro 
35 40 45 

Asp His Pro Gly Gly Ser Val He Leu Thr His Val Gly Lys Asp Gly 
50 55 60 



Thr Asp Val Phe Asp Thr Phe His Pro Glu Ala Ala Trp Glu Thr Leu 
35 65 70 75 80 

Ala Asn Phe Tyr Val Gly Asp He Asp Glu Ser Asp Arg Asp He Lys 

85 90 95 

40 Asn Asp Asp Phe Ala Ala Glu Val Arg Lys Leu Arg Thr Leu Phe Gin 

100 105 110 



Ser Leu Gly Tyr Tyr Asp Ser Ser Lys Ala Tyr Tyr Ala Phe Lys Val 
115 120 125 

Ser Phe Asn Leu Cys He Trp Gly Leu Ser Thr Val He Val Ala Lys 
130 135 140 



Trp Gly Gin Thr Ser Thr Leu Ala Asn Val Leu Ser Ala Ala Leu Leu 
50 145 150 155 160 

Gly Leu Phe Trp Gin Gin Cys Gly Trp Leu Ala His Asp Phe Leu His 

165 170 175 

55 His Gin Val Phe Gin Asp Arg Phe Trp Gly Asp Leu Phe Gly Ala Phe 

180 185 190 



Leu Gly Gly Val Cys Gin Gly Phe Ser Ser Ser Trp Trp Lys Asp Lys 
195 200 205 

His Asn Thr His His Ala Ala Pro Asn Val His Val Glu Asp Pro Asp 
210 215 220 
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He Asp Thr His Pro Leu Leu Thr Trp Ser Glu His Ala Leu Glu Met 
225 230 235 240 

Phe Ser Asp Val Pro Asp Glu Glu Leu Thr Arg Met Trp Ser Arg Phe 

245 250 255 

Met Val Leu Asn Gin Thr Trp Phe Tyr Phe Pro He Leu Ser Phe Ala 

260 265 270 

Arg Leu Ser Trp Cys Leu Gin Ser He Leu Phe Val Leu Pro Asn Gly 
275 280 285 



Gin Ala His Lys Pro Ser Gly Ala Arg Val Pro He Ser Leu Val Glu 
15 290 295 300 

Gin Leu Ser Leu Ala Met His Trp Thr Trp Tyr Leu Ala Thr Met Phe 
305 310 315 320 

20 Leu Phe He Lys Asp Pro Val Asn Met Leu Val Tyr Phe Leu Val Ser 

325 330 335 



Gin Ala Val Cys Gly Asn Leu Leu Ala He Val Phe Ser Leu Asn His 

340 345 350 

Asn Gly Met Pro Val He Ser Lys Glu Glu Ala Val Asp Met Asp Phe 

355 360 365 



Phe Thr Lys Gin He He Thr Gly Arg Asp Val His Pro Gly Leu Phe 
30 370 375 380 

Ala Asn Trp Phe Thr Gly Gly Leu Asn Tyr Gin He Glu His His Leu 
385 390 395 400 

35 Phe Pro Ser Met Pro Arg His Asn Phe Ser Lys He Gin Pro Ala Val 

405 410 415 



Glu Thr Leu Cys Lys Lys Tyr Asn Val Arg Tyr His Thr Thr Gly Met 

420 425 430 



He Glu Gly Thr Ala Glu Val Phe Ser Arg Leu Asn Glu Val Ser Lys 
435 440 445 

45 Ala Ala Ser Lys Met Gly Lys Ala Gin 

450 455 

(2) INFORMATION FOR SEQ ID NO: 5: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) I>ENGTH: 446 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Ala Ala Gin He Lye Lys Tyr He Thr Ser Asp Glu Leu Lys Asn 
15 10 15 
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His Asp Lys Pro Gly Asp Leu Trp lie Ser lie Gin Gly Lys Ala Tyr 

20 25 30 

Asp Val Ser Asp Trp Val Lys Asp His Pro Gly Gly Ser Phe Pro Leu 
35 40 45 

Lys Ser Leu Ala Gly Gin Glu Val Thr Asp Ala Phe Val Ala Phe His 
50 55 60 

Pro Ala Ser Thr Trp Lys Asn Leu Asp Lys Phe Phe Thr Gly Tyr Tyr 
65 70 75 80 



15 



Leu Lys Asp Tyr Ser Val Ser Glu Val Ser Lys Val Tyr Arg Lys Leu 

85 90 95 



20 



Val Phe Glu Phe Ser Lys Met Gly Leu Tyr Asp Lys Lys Gly His lie 

100 105 110 

Met Phe Ala Thr Leu Cys Phe He Ala Met Leu Phe Ala Met Ser Val 
.115 120 125 



25 



Tyr Gly Val Leu Phe Cys Glu Gly Val Leu Val His Leu Phe Ser Gly 
130 135 140 

Cys Leu Met Gly Phe Leu Trp He Gin Ser Gly Trp He Gly His Asp 

145 150 155 160 



30 



Ala Gly His Tyr Met Val Val Ser Asp Ser Arg Leu Asn Lys Phe Met 

165 170 175 



35 



Gly He Phe Ala Ala Asn Cys Leu Ser Gly He Ser He Gly Trp Trp 

180 185 190 

Lys Trp Asn His Asn Ala His His He Ala Cys Asn Ser Leu Glu Tyr 
195 200 205 



40 



Asp Pro Asp Leu Gin Tyr He Pro Phe Leu Val Val Ser Ser Lys Phe 

210 215 220 

Phe Gly Ser Leu Thr Ser His Phe Tyr Glu Lye Arg Leu Thr Phe Asp 

225 230 235 240 



45 



Ser Leu Ser Arg Phe Phe Val Ser Tyr Gin His Trp Thr Phe Tyr Pro 

245 250 255 



50 



He Met Cys Ala Ala Arg Leu Asn Met Tyr Val Gin Ser Leu He Met 

260 265 270 

Leu Leu Thr Lys Arg Asn Val Ser Tyr Arg Ala Gin Glu Leu Leu Gly 
275 280 285 



55 



Cys Leu Val Phe Ser He Trp Tyr Pro Leu Leu Val Ser Cys Leu Pro 
290 295 300 

Asn Trp Gly Glu Arg He Met Phe Val He Ala Ser Leu Ser Val Thr 
305 310 315 320 



60 



Gly Met Gin Gin Val Gin Phe Ser Leu Asn His Phe Ser Ser Ser Val 

325 330 335 



Tyr Val Gly Lys Pro Lys Gly Asn Asn Trp Phe Glu Lys Gin Thr Asp 
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340 345 350 

Gly Thr Leu Asp lie Ser Cys Pro Pro Trp Met Asp Trp Phe His Gly 
355 360 365 

5 

Gly Leu Gin Phe Gin lie Glu His Hie Leu Phe Pro Lys Met Pro Arg 
370 375 380 

Cys Asn Leu Arg Lys lie Ser Pro Tyr Val lie Glu Leu Cye Lys Lys 
10 385 390 395 400 

His Asn Leu Pro Tyr Asn Tyr Ala Ser Phe Ser Lys Ala Asn Glu Met 

405 410 415 

15 Thr Leu Arg Thr Leu Arg Aen Thr Ala Leu Gin Ala Arg Asp He Thr 

420 425 430 



20 



30 



40 



45 



50 



55 



60 



Lys Pro Leu Pro Lys Asn Leu Val Trp Glu Ala Leu His Thr 
435 440 445 

(2) INFORMATION FOR SEQ ID NO: 6: 



(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 359 amino acids 
25 (B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 
<D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



Met Leu Thr Ala Glu Arg He Lye Phe Thr Gin Lys Arg Gly Phe Arg 
35 1 5 10 15 



Arg Val Leu Asn Gin Arg Val Asp Ala Tyr Phe Ala Glu His Gly Leu 

20 25 30 

Thr Gin Arg Asp Asn Pro Ser Met Tyr Leu Lys Thr Leu He lie Val 
35 40 45 

Leu Trp Leu Phe Ser Ala Trp Ala Phe Val Leu Phe Ala Pro Val He 
50 55 60 

Phe Pro Val Arg Leu Leu Gly Cys Met Val Leu Ala He Ala Leu Ala 
65 70 75 80 

Ala Phe Ser Phe Asn Val Gly His Asp Ala Asn His Asn Ala Tyr Ser 

85 90 95 

Ser Asn Pro His He Asn Arg Val Leu Gly Met Thr Tyr Asp Phe Val 

100 105 110 

Gly Leu Ser Ser Phe Leu Trp Arg Tyr Arg His Asn Tyr Leu His His 
115 120 125 

Thr Tyr Thr Asn He Leu Gly His Asp Val Glu He His Gly Asp Gly 
130 135 140 

Ala Val Arg Met Ser Pro Glu Gin Glu His Val Gly He Tyr Arg Phe 
145 150 155 160 
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Gin Gin Phe Tyr lie Trp Gly Leu Tyr Leu Phe lie Pro Phe Tyr Trp 

165 170 175 

Phe Leu Tyr Asp Val Tyr Leu Val Leu Asn Lys Gly Lys Tyr His Asp 

160 185 190 

His Lys lie Pro Pro Phe Gin Pro Leu Glu Leu Ala Ser Leu Leu Gly 
195 200 205 

He Lys Leu Leu Trp Leu Gly Tyr Val Phe Gly Leu Pro Leu Ala Leu 
210 215 220 



Gly Phe Ser He Pro Glu Val Leu He Gly Ala Ser Val Thr Tyr Met 

15 225 230 235 240 

Thr Tyr Gly He Val Val Cys Thr He Phe Met Leu Ala His Val Leu 

245 250 255 

20 Glu Ser Thr Glu Phe Leu Thr Pro Asp Gly Glu Ser Gly Ala He Asp 

260 265 270 



Asp Glu Trp Ala He Cys Gin He Arg Thr Thr Ala Asn Phe Ala Thr 
275 280 285 

Asn Asn Pro Phe Trp Asn Trp Phe Cys Gly Gly Leu Asn His Gin Val 
290 295 300 



Thr His His Leu Phe Pro Asn He Cys His He His Tyr Pro Gin Leu 
30 305 310 315 320 



Glu Asn He He Lys Asp Val Cys Gin Glu Phe Gly Val Glu Tyr Lys 

325 330 335 

Val Tyr Pro Thr Phe Lys Ala Ala He Ala Ser Asn Tyr Arg Trp Leu 

340 345 350 

Glu Ala Met Gly Lys Ala Ser 
355 

(2) INFORMATION FOR SEQ ID NO: 7; 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 65 amino acids 
45 (B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Met Thr Ser Thr Thr Ser Lys Val Thr Phe Gly Lys Ser He Gly Phe 
15 10 15 

Arg Lys Glu Leu Asn Arg Arg Val Asn Ala Tyr Leu Glu Ala Glu Asn 

20 25 30 

He Ser Pro Arg Asp Asn Pro Pro Met Tyr Leu Lys Thr Ala He He 
35 40 45 
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30 
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Leu Ala Trp Val Val Ser Ala Trp Thr Phe Val Val Phe Gly Pro Asp 
50 55 60 

Val Leu Trp Met Lys Leu Leu Gly Cys He Val Leu Gly Phe Gly Val 
65 70 75 60 

Ser Ala Val Gly Phe Asn He Ser His Asp Gly Asn His Gly Gly Tyr 

85 90 95 

Ser Lys Tyr Gin Trp Val Asn Tyr Leu Ser Gly Leu Thr His Asp Ala 

100 105 110 

He Gly Val Ser Ser Tyr Leu Trp Lye Phe Arg His Asn Val Leu His 
115 120 125 

His Thr Tyr Thr Asn He Leu Gly His Asp Val Glu He His Gly Asp 
130 135 140 



Glu Leu Val Arg Met Ser Pro Ser Met Glu Tyr Arg Trp Tyr His Arg 
20 145 150 155 160 



Tyr Gin His Trp Phe He Trp Phe Val Tyr Pro Phe He Pro Tyr Tyr 

165 170 175 

Trp Ser He Ala Asp Val Gin Thr Met Leu Phe Lys Arg Gin Tyr His 

180 les 190 

Asp His Glu He Pro Ser Pro Thr Trp Val Asp He Ala Thr Leu Leu 
195 200 205 

Ala Phe Lys Ala Phe Gly Val Ala Val Phe Leu He He Pro He Ala 
210 215 220 

« 

Val Gly Tyr Ser Pro Leu Glu Ala Val He Gly Ala Ser He Val Tyr 
225 230 235 240 

Met Thr His Gly Leu Val Ala Cys Val Val Phe Met Leu Ala His Val 

245 250 255 

He Glu Pro Ala Glu Phe Leu Asp Pro Asp Asn Leu His He Asp Asp 

260 265 270 

Glu Trp Ala He Ala Gin Val Lys Thr Thr Val Asp Phe Ala Pro Asn 
275 280 285 

Asn Thr He He Asn Trp Tyr Val Gly Gly Leu Asn Tyr Gin Thr Val 
290 295 300 



His His Leu Phe Pro His He Cys His He His Tyr Pro Lys He Ala 
50 305 310 315 320 



Pro He Leu Ala Glu Val Cys Glu Glu Phe Gly Val Asn Tyr Ala Val 

325 330 335 

His Gin Thr Phe Phe Gly Ala Leu Ala Ala Asn Tyr Ser Trp Leu Lys 

340 345 350 

Lys Met Ser He Asn Pro Glu Thr Lys Ala He Glu Gin 
355 360 365 

(2) INFORMATION FOR SEQ ID NO: 8: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 2 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Synthetic oligonucleotide" 



(ix) FEATURE: 

<A) NAME/KEY: misc^f eature 
(B) LOCATION: 21 

(D) OTHER INFORMATION: /number* 1 
15 /notes "Nsinoeine or Cytosine" 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 27 

20 (D) OTHER INFORMATION: /number^ 2 

/note= "N-Inosine or Cytosine" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
CUACUACUAC UACAYCAYAC NTAYACNAAY AT 32 
(2) INFORMATION FOR SEQ ID NO: 9: 



30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc « "Synthetic oligonucleotide" 



40 (ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 13 

(D) OTHER INFORMATION: /nuinber= 1 
/notes "N^Inosine or Cytosine" 

45 

(ix) FEATURE: 

(A) NAME/KEY: misc_fcature 

(B) LOCATION: 19 

(D) OTHER INFORMATION: /number- 2 
50 /notes "Neinosine or Cytosine" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9 
55 CAUCAUCAUC AUNGGRAANA RRTGRTG 

(2) INFORMATION FOR SEQ ID NO: 10: 



27 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: Other nucleic acid 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CCAAGCTTCT GCAGGAGCTC tTTTTTTTTT ttTTT 35 
10 (2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

IS (C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 



20 



25 



35 



45 



55 



60 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

His Xaa Xaa His His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 12: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 5 amino acids 
30 (B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 



Gin Xaa Xaa His His 
40 1 5 



(2) INFORMATION FOR SEQ ID NO: 13: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 746 nucleic acids 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: not relevant 
50 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

CGTATGTCAC TCCATTCCAA ACTCGTTCAT GGTATCATAA ATATCAACAC ATTTACGCTC 60 

CACrrCCTCTA TGGTATTTAC ACACTCAAAT ATCGTACTCA AGATTGGGAA GCTTTTGTAA 120 

AGGATGGTAA AAATGGTGCA ATTCGTGTTA GTGTCGCCAC AAATTTCGAT AAGGCCGCTT 180 

ACGTCATTGG TAAATTGTCT TTTGTTTTCT TCCGTTTCAT CCTTCCACTC CGTTATCATA 240 

GCTTTACAGA TTTAATTTGT TATTTCCTCA TTGCTGAATT CGTCTTTGGT TGGTATCTCA 300 

CAATTAATTT CCAAGTTAGT CATGTCGCTG AAGATCTCAA ATTCTTTGCT ACCCCTGAAA 360 

GACCAGATGA ACCATCTCAA ATCAATGAAG ATTGGGCAAT CCTTCAACTT AAAACTACTC 420 
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AAGATTATGG TCATGGTTCA CTCCTTTGTA 
TTGTTCATCA TTTATTCCCA TCAATTGCTC 
TAAAAGAAGT TTGTAAAGAA CATAACATTA 
CTATTATGTC ACACATTAAT TACCTTTACA 
AACCATTAGC CTCAAAAGAT GATTAAATGA 
ACAAACAGTA ATATTAATAA ATACAA 



CCTTTTTTAG TGGTTCTTTA AATCT^TCAAG 
AAGATTTCTA CCCACAACTT GTACCAATTG 
CTTACCACAT TAAACCAAAC TTCACTGAAG 
AAATGGGTAA TGATCCAGAT TATGTTAT^ 
AATAACTTAA AAACCAATTA TTTACTTTTG 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 227 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



Tyr 


Val 


Thr 


Pro 


Phe 


Gin Thr Arg 


Ser 


Trp Tyr His 


Lys Tyr Gin 


1 








5 










10 




15 


His 


He 


Tyr 


Ala 


Pro 


Leu 


Leu 


Tyr 


Gly 


He Tyr Thr 


Leu Lys Tyr 










20 










25 




30 


Arg 


Thr 


Gin 


Asp 


Trp 


Glu 


Ala 


Phe 


Val 


Lys Asp Gly Lys Asn Gly 










2S 










40 




45 


Ala 


He 


Arg 


Val 


Ser 


Val 


Ala 


Thr 


Asn 


Phe Asp Lys Ala Ala Tyr 










50 










55 




60 


Val 


He 


Gly 


Lys 


Leu 


Ser 


Phe 


Val 


Phe 


Phe Arg 


Phe 


He Leu Pro 










65 










70 




75 


Leu 


Arg 


Tyr 


His 


Ser 


Phe 


Thr Asp 


Leu 


He Cys 


Tyr 


Phe Leu He 










80 










85 




90 


Ala 


Glu 


Phe 


Val 


Phe 


Gly Trp Tyr 


Leu 


Thr He 


Asn 


Phe Gin Val 










95 










100 




105 


Ser 


His 


Val 


Ala 


Glu 


Asp 


Leu 


Lys 


Phe 


Phe Ala 


Thr 


Pro Glu Arg 










110 










115 




120 


Pro Asp Glu 


Pro 


Ser 


Gin 


He 


Asn 


Glu 


Asp Trp 


Ala 


He Leu Gin 










125 










130 




135 


Leu 


Lys 


Thr 


Thr 


Gin 


Asp 


Tyr 


Gly 


His 


Gly Ser 


Leu 


Leu Cys Thr 










140 










145 




150 


Phe 


Phe 


Ser 


Gly 


Ser 


Leu 


Asn 


His 


Gin 


Val Val 


His 


His Leu Phe 










155 










160 




165 


Pro 


Ser 


He 


Ala 


Gin 


Asp 


Phe 


Tyr 


Pro 


Gin Leu 


Val 


Pro He Val 










170 










175 




180 


Lys 


Glu 


Val 


Cys 


Lys 


Glu 


His 


Asn 


He 


Thr Tyr 


His 


He Lys Pro 










185 










190 




195 


Asn 


Phe 


Thr 


Glu 


Ala 


He 


Met 


Ser 


His 


He Asn Tyr Leu Tyr Lys 










200 










205 




210 


Met 


Gly Asn 


Asp 


Pro 


Asp 


Tyr 


Val 


Lys 


Lys Pro 


Leu 


Ala Ser Lys 










215 










220 




225 



Asp Asp ♦** 



<2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 94 nucleic acids 

(B) TYPE: nucleic acid 

<C) STRANDEDNESS: not relevant 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 



103 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



TTTTGGAAGG NTCCAAGTTN ACCACGGANT 
CCCCCCAAGC CTTTTGTCGA CTGGTTCTGT 
TTATTCCCCA GCCTGCCCCG ACACAATCTG 
TGCAAGGAGT GGGGTGTCCA GTACCACGAA 
TTGCACCATT T6GGCAGCGT GGCCGGCGAA 
GCCATGTAAT CGTCGTTCGT GACGATGCAA 
ACACAACTAG TGTAACTCGT ATAGAATTCG 
GGGATAGGGT AGGTAGGCGG ACGCGTGGGT 
GCCCGCGTNA AAGT 



NGGCAAGTTN ACGGGGCGGA AANCGGTTTT 
GGTGGCTTCC AGTACCAAGT CGACCACCAC 
GCCAAGACAC ACGCACTGGT CGAATCGTTC 
GCCGACCTCG TGGACGGGAC CATGGAAGTC 
TTCGTCGTGG ATTTTGTACG CGACGGACCC 
GGGTTCACGC ACATCTACAC ACACTCACTC 
GTGTCGACCT GGACCTTGTT TGACTGGTTG 
CGNCCCCGGG AATTCTGTGA CCGGTACCTG 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:16; 



Phe Trp 


Lys 


Xxx 


Pro 


Ser 


Xxx 


Pro 


Arg 


Xxx 


Xxx 


Gin 


Val 


Xxx Gly 


1 






5 










10 








15 


Ala Glu 


Xxx 


Gly 


Phe 
20 


Pro 


Pro 


Lys 


Pro 


Phe 

25 


Val 


Asp 


Trp 


Phe Cys 
30 


Gly Gly 


Phe 


Gin 


Tyr 

35 


Gin 


Val 


Asp 


His 


His 
40 


Leu 


Phe 


Pro 


Ser Leu 
45 


Pro Arg 


His 


Asn 


Leu 
50 


Ala 


Lys 


Thr 


His 


Ala 

55 


Leu 


Val 


Glu 


Ser Phe 
60 


Cys Lys 


Glu 


Trp 


Gly 
65 


Val 


Gin 


Tyr 


His 


Glu 
70 


Ala 


Asp 


Leu 


Val Asp 
75 


Gly Thr 


Met 


Glu 


Val 
65 


Leu 


His 


His 


Leu 


Gly 
70 


Ser 


Val 


Ala 


Gly Glu 
75 


Phe Val 


Val 


Asp 


Phe 
80 


Val 


Arg 


Asp 


Gly 


Pro 
85 


Ala 


Met 







(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 520 nucleic acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



GGATGGAGTT CGTCTGGATC 
CTTGGGTACA CGCCGGGGCA 
ATTTACATTT TTCTGCAGTT 
GATCAGCTGC ATTGGCTCGA 



GCTGTGCGCT ACGCGACGTG 
GTCGTTGGGC ATGTACTTGT 
CGCCGTAAGT CACACCCATT 
GTACGCGCGG ACCACACTGT 

-104- 



GTTTAAGCGT CATGGGTGCG 
GCGCCTTTGG TCTCGGCTGC 
TGCCCGTGAG CAACCCGGAG 
GAACATCAGC ACCAAGTCGT 



GGTTTGTCAC ATGGTGGATG 
CGGCGCCCCA GTTCCGTTTC 
ACGGTCTCCC TTACTACGAC 
TCTACTCCGT CGGCCATTCC 
TTAATTCCCC ACCCCACCCC 



TCGAACCTCA ACTTTCAGAT 
AAGGAGATCA GCCCGCGCGT 
ATGCCCTACA CGAGCGCCGT 
GTCGGCGACG CCAAGCGCGA 
ATGTTCTGTC TTCCTCCCGC 



CGAGCACCAC CTTTTCCCCA 
CGAGGCCCTC TTCAAGCGCC 
CTCCACCACC TTTGCCAACC 
CTAGCCTCTT TTCCTAGACC 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 153 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



Met Glu Phe 
1 

Arg His Gly 
Val Leu Val 
Val Arg Arg 
Ser Ala Ala 
Ser Thr Lys 
Phe Gin He 
Phe Lys Glu 
Gly Leu Pro 
Thr Phe Ala 
Lys Arg Asp 



Val Trp He 
5 

Cys Ala Trp 
20 

Arg Leu Trp 
35 

Lys Ser His 
50 

Leu Ala Arg 
65 

Ser Trp Phe 
80 

Glu His His 
95 

He Ser Pro 

110 
Tyr Tyr Asp 

125 
Asn Leu Tyr 

140 



Ala Val Arg 
Val His Ala 
Ser Arg Leu 
Pro Phe Ala 
Val Arg Ala 
Val Thr Tarp 
Leu Phe Pro 
Arg Val Glu 
Met Pro Tyr 
Ser Val Gly 



Tyr Ala Thr 
10 

Gly Ala Val 
25 

His Leu His 
40 

Arg Glu Gin 
55 

Asp His Thr 
70 

Trp Met Ser 
85 

Thr Ala Pro 
100 

Ala Leu Phe 
115 

Thr Ser Ala 
130 

His Ser Val 
145 



Trp Phe Lys 
15 

Val Gly His 
30 

Phe Ser Ala 
45 

Pro Gly Gly 
60 

Val Asn He 
75 

Asn Leu Asn 
90 

Gin Phe Arg 
105 

Lys Arg His 
120 

Val Ser Thr 
135 

Gly Asp Ala 

150 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 429 nucleic acids 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



ACGCGTCCGC CCACGCGTCC 
GCTCCCGCAC ATGACGTACC 
GCTGATGGGT CAGTCTTCAC 
TCAGGGTCGC TGCGGCTGGG 
TTGGCTTGAC GACCGGTTGT 



GCCGCGAGCA ACTCATCAAG 
GCGTGGTCGA GATTGTTGTT 
CCCTCGCGCT CGCTCTCGGC 
TAATGCATGA GATGGGCCAT 
GCGAGTTCTT TTACGGCGTT 



GAAGGCTACT TTGACCCCTC 
CTCTTCGTGC TTTCCTTTTG 
ATTGTCGTCA GCGGCATCTC 
GGGTCGTTCA CTGGTGTCAT 
GGTTGTGGCA TGAGCGGTCA 



105 
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15 



20 



25 



30 



35 



50 



55 



60 



TTACTGGAAA AACCAGCACA GCAAACACCA CGCAGCGCCA AACCGGCTCG AGCACGATGT 360 
AGATCTCMC ACCTTGCCAT TGGTGGCCTT CAACGAGCGC GTCGTGCGCA AGGTCCGACC 420 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 
10 (B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTIONS SEQ ID NO: 20 



Arg Val 


Arg 


Pro 


Arg 


Val 


Arg 


Arg 


Glu 


Gin 


Leu 


lie 


Lys 


Glu 


Gly 


1 






5 










10 










15 


Tyr Phe 


Asp 


Pro 


Ser 
20 


Leu 


Pro 


His 


Met 


Thr 
25 


Tyr 


Arg 


Val 


Val 


Glu 
30 


lie Val 


Val 


Leu 


Phe 
35 


Val 


Leu 


Ser 


Phe 


Trp 
40 


Leu 


Met 


Gly 


Gin 


Ser 
45 


Ser Pro 


Leu 


Ala 


Leu 


Ala 


Leu 


Gly 


lie 


Val 


Val 


Ser 


Gly 


He 


Ser 








50 










55 








60 


Gin Gly 


Arg 


Cye 


Gly 


Trp 


Val 


Met 


His 


Glu 


Met 


Gly 


His 


Gly 


Ser 








65 










70 








75 


Phe Thr 


Gly 


Val 


lie 


Trp 


Leu 


Asp 


Asp Arg 


Leu 


Cys 


Glu 


Phe 


Phe 








65 










70 










75 


Tyr Gly 


Val 


Gly 


Cys 


Gly 


Met 


Ser 


Gly His 


Tyr 


Trp 


Lys 


Asn 


Gin 








80 










85 










90 


His Ser 


Lys 


His 


His 


Ala 


Ala 


Pro 


Asn 


Arg 


Leu 


Glu 


His 


Asp 


Val 








95 










100 








105 


Asp Leu 


Asn 


Thr 


Leu 


Pro 


Leu 


Val 


Ala 


Phe 


Asn 


Glu 


Arg 


Val 


Val 








110 










lis 








120 


Arg Lys 


Val 


Arg 


Pro 
125 























40 (2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1219 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid (Edited Contig 2692004) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

GCACGCCGAC CGGCGCCGGG AGATCCTGGC AAAGTATCCA GAGATAAAGT CCTTGATGAA 60 

ACCTGATCCC AATTTGATAT GGATTATAAT TATGATGGTT CTCACCCAGT TGGGTGCATT 120 

TTACATAGTA AAAGACTTGG ACTGGAAATG GGTCATATTT GGGGCCTATG CGTTTGGCAG 180 

TTGCATTAAC CACTCAATGA CTCTGGCTAT TCATGAGATT GCCCACAATG CTGCCTTTGG 240 

CAACTGCAAA GCAATGTGGA ATCGCTGGTT TGGAATGTTT GCTAATCTTC CTATTGGGAT 300 



-106- 
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TCCATATTCA 


ATTTCCTTTA 


AGAGGTATCA 


CATGGATCAT 




TT ^ P 21 r"TT' a 
J. i UO/^VsC 1 Ls/l 


JoU 




TGGCGTCGAT 


GTAGATATTC 


CTACCGATTT 






V^L^VjU ill ^/io 


/ion 


5 


AAAGTTTATA 


TGGGTTATTC 


TTCAGCCTCT 


CTTTTATGCr 




loll ^/i 1 


4 OU 




CCCCAAACCA 


ATTACGTATC 


TGGAAGTTAT 


CAATACCGTG 




^ 1 I 1 1 VsH^n i 


0 4 U 


10 


TTTAATTTAT 


TACTTTTTGG 


GAATTAAATC 


CTTAGTCTAC 


ATGTTGGCAG 


CATTTTTAPT 

\^r\ X W X X X n W 1 




TGGCCTGGGT 


TTGCACCCAA 


TTTCTGGACA 


TTTTATAGCT 


GAGCATTACA 


TGTTPTTZl Aa 


oou 




GGGTCATGAA 


ACTTACTCAT 


ATTATGGGCC 


TCTGAATTTA 






T 0 A 


15 


TCATAATGAA 


CATCATGATT 


TCCCCAACAT 


TCCTGGAAAA 


«>wX V« X A ^^/IW 




# oU 




AATAGCAGCT 


GAATACTATG 


ACAACCTCCC 


TCACTACAAT 






fi ^ o 


20 


TGATTTTGTG 


ATGGATGATA 


CAATAAGTCC 


CTACTCAAGA 


ATGAAGAGGP 






AGAGATGGTG 


CTGGAGTAAA 


TATCATTAGT 


GCCAAAGGGA 


TTCTTCTCCA 


AAACTTTAGA 


960 




TGATAAAATG 


GAATTTTTGC 


ATTATTAAAC 


TTGAGACCAG 


TGATGCTCAG 


AAGCTCCCCT 


1020 


25 


GGCACAATTT 


CAGAGTAAGA 


GCTCGGTGAT 


ACCAAGAAGT 


GAATCTGGCT 


TTTAAACAGT 


1080 




CAGCCTGACT 


CTGTACTGCT 


CAGTTTCACT 


CACAGGAAAC 


TTGTGACTTG 


TGTATTATCG 


1140 


30 


TCATTGAGGA 


TGTTTCACTC 


ATGTCTGTCA 


TTTTATAAGC 


ATATCATTTA 


AAAAGCTTCT 


1200 


AAAAAGCTAT 


TTCGCCAGG 










1219 



35 



55 



65 



(2) INFORMATION FOR SEQ ID NO: 22: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 655 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
40 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid (Edited Contig 2153526) 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

TTACCTTCTA CGTCCGCTTC TTCCTCACTT ATGTGCCACT ATTGGGGCTG AAAGCTTCCT 60 

50 GGGCCTTTTC TTCATAGTCA GGTTCCTGGA AAGCAACTGG TTTGTGTGGG TGACACAGAT 120 

GAACCATATT CCCATGCACA TTGATCATGA CCGGAACATG GACTGGGTTT CCACCCAGCT 180 

CCAGGCCACA TGCAATGTCC ACAAGTCTGC CTTCAATGAC TGGTTCAGTG GACACCTCAA 240 

CTTCCAGATT GAGCACCATC TTTTTCCCAC GATGCCTCGA CACAATTACC ACAAAGTGGC 300 

TCCCCTGGTG CAGTCCTTGT GTGCCAAGCA TGGCATAGAG TACCAGTCCA AGCCCCTGCT 360 

60 GTCAGCCTTC GCCGACATCA TCCACTCACT AAAGGAGTCA GGGCAGCTCT GGCTAGATGC 420 

CTATCTTCAC CAATAACAAC AGCCACCCTG CCCAGTCTGG AAGAAGAGGA GGAAGACTCT 480 

GGAGCCAAGG CAGAGGGGAG CTTGAGGGAC AATGCCACTA TAGTTTAATA CTCAGAGGGG 540 



-107- 
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GTTGGGTTTG GGGACATAAA GCCTCTGACT CAAACTCCTC CCTTTTATCT TCTAGCCACA 600 
GTTCTAAGAC CCAAAGTGGG GGGTGGACAC AGAAGTCCCT AGGAGGGAAG GAGCT 655 

5 

(2) INFORMATIOK FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 304 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



15 



(ii) MOLECULE TYPE: other nucleic acid (Edited Contig 3506132) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

GTCTTTTACT TTGGCAATGG CTGGATTCCT ACCCTCATCA CGGCCTTTGT CCTTGCTACC 60 

20 TCTCAGGCCC AAGCTGGATG GCTGCAACAT GATTATGGCC ACCTGTCTGT CTACAGAAAA 120 

CCCAAGTGGA ACCACCTTGT CCACAAATTC GTCATTGGCC ACTTAAAGGG TGCCTCTGCC 180 

AACTGGTGGA ATCATCGCCA CTTCCAGCAC CACGCCAAGC CTAACATCTT CCACAAGGAT 240 

CCCGATGTGA ACATGCTGCA CGTGTTTGTT CTGGGCGAAT GGCAGCCCAT CGAGTACGGC 300 
AAGA 



25 



30 



40 



304 



(2) INFORMATION FOR SEQ ID NO: 24: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 918 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid (Edited Contig 3854 933) 





(xi) SEQUENCE DESCRIPTION: 


SEQ ID NO 


:24: 








CAGGGACCTA 


CCCCGCGCTA 


CTTCACCTGG 


GACGAGGTGG 


CCCAGCGCTC 


AGGGTGCGAG 


60 


45 


GAGCGGTGGC 


TAGTGATCGA 


CCGTAAGGTG 


TACAACATCA 


GCGAGTTCAC 


CCGCCGGCAT 


120 




CCAGGGGGCT 


CCCGGGTCAT 


CAGCCACTAC 


GCCGGGCAGG 


ATGCCACGGA 


TCCCTTTGTG 


180 


50 


GCCTTCCACA 


TCAACAAGGG 


CCTTGTGAAG 


AAGTATATGA 


ACTCTCTCCT 


GATTGGAGAA 


240 


CTGTCTCCAG 


AGCAGCCCAG 


CTTTGAGCCC 


ACCAAGAATA 


AAGAGCTGAC 


AGATGAGTTC 


300 




CGGGAGCTGC 


GGGCCACAGT 


GGAGCGGATG 


GGGCTCATGA 


AGGCCAACCA 


TGTCTTCTTC 


360 


55 


CTGCTGTACC 


TGCTGCACAT 


CTTGCTGCTG 


GATGGTGCAG 


CCTGGCTCAC 


CCTTTGGGTC 


420 




TTTGGGACGT 


CCTTTTTGCC 


CTTCCTCCTC 


TGTGCGGTGC 


TGCTCAGTGC 


AGTTCAGGCC 


480 


60 


CAGGCTGGCT 


GGCTGCAGCA 


TGACTTTGGG 


CACCTGTCGG 


TCTTCAGCAC 


CTCAAAGTGG 


540 




AACCATCTGC 


TACATCATTT 


TGTGATTGGC 


CACCTGAAGG 


GGGCCCCCGC 


CAGTTGGTGG 


600 




AACCACATGC 


ACTTCCAGCA 


CCATGCCAAG 


CCCAACTGCT 


TCCGCAAAGA 


CCCAGACATC 


660 
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AACATGCATC CCTTCTTCTT TGCCTTGGGG AAGATCCTCT CTGTGGAGCT TGGGAAACAG 
AAGAAAAAAT ATATGCCGTA CAACCACCAG CACARATACT TCTTCCTAAT TGGGCCCCCA 
5 GCCTTGCTGC CTCTCTACTT CCAGTGGTAT ATTTTCTATT TTGTTATCCA GCGAAAGAAG 

TGGGTGGACT TGGCCTGGAT CAGCAAACAG GAATACGATG AAGCCGGGCT TCCATTGTCC 
ACCGCAAATG CTTCTAAA 

10 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 1686 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE; other nucleic acid (Edited Contig 2511785) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 



25 


GCCACTTAAA 


GGGTGCCTCT 


GCCAACTGGT 


GGAATCATCG 


CCACTTCCAG 


CACCACGCCA 


' 60 




AGCCTAACAT 


CTTCCACAAG 


GATCCCGATG 


TGAACATGCT 


GCACGTGTTT 


GTTCTGGGCG 


120 


30 


AATGGCAGCC 


CATCGAGTAC 


GGCAAGAAGA 


AGCTGAAATA 


CCTGCCCTAC 


AATCACCAGC 


180 


ACGAATACTT 


CTTCCTGATT 


GGGCCGCCGC 


TGCTCATCCC 


CATGTATTTC 


CAGTACCAGA 


240 




TCATCATGAC 


CATGATCGTC 


CATAAGAACT 


GGGTGGACCT 


GGCCTGGGCC 


GTCAGCTACT 


300 


35 


ACATCCGGTT 


CTTCATCACC 


TACATCCCTT 


TCTACGGCAT 


CCTGGGAGCC 


CTCCTTTTCC 


360 




TCAACTTCAT 


CAGGTTCCTG 


GAGAGCCACT 


GGTTTGTGTG 


GGTCACACAG 


ATGAATCACA 


420 


40 


TCGTCATGGA 


GATTGACCAG 


GAGGCCTACC 


GTGACTGGTT 


CAGTAGCCAG 


CTGACAGCCA 


480 


CCTGCAACGT 


GGAGCAGTCC 


TTCTTCAACG 


ACTGGTTCAG 


TGGACACCTT 


AACTTCCAGA 


540 




TTGAGCACCA 


CCTCTTCCCC 


ACCATGCCCC 


GGCACAACTT 


ACACAAGATC 


GCCCCGCTGG 


600 


45 


TGAAGTCTCT 


ATGTGCCAAG 


CATGGCATTG 


AATACCAGGA 


GAAGCCGCTA 


CTGAGGGCCC 


660 




TGCTGGACAT 


CATCAGGTCC 


CTGAAGAAGT 


CTG6GAAGCT 


GTGGCTGGAC 


GCCTACCTTC 


720 


50 


ACAAATGAAG 


CCACAGCCCC 


CGGGACACCG 


TGGGGAAGGG 


GTGCAGGTGG 


GGTGATGGCC 


780 


AGAGGAATGA 


TGGGCTTTTG 


TTCTGAGGGG 


TGTCCGAGAG 


GCTGGTGTAT 


GCACTGCTCA 


840 




CGGACCCCAT 


GTTGGATCTT 


TCTCCCTTTC 


TCCTCTCCTT 


TTTCTCTTCA 


CATCTCCCCC 


900 


55 


ATAGCACCCT 


GCCCTCATGG 


GACCTGCCCT 


CCCTCAGCCG 


TCAGCCATCA 


GCCATGGCCC 


960 




TCCCAGTGCC 


TCCTAGCCCC 


TTCTTCCAAG 


GAGCAGAGAG 


GTGGCCACCG 


GGGGTGGCTC 


1020 


60 


TGTCCTACCT 


CCACTCTCTG 


CCCCTAAAGA 


TGGGAGGAGA 


CCAGCGGTCC 


ATGGGTCTGG 


1080 


CCTGTGAGTC 


TCCCCTTGCA 


GCCTGGTCAC 


TAGGCATCAC 


CCCCGCTTTG 


GTTCTTCAGA 


1140 




TGCTCTTGGG 


GTTCATAGGG 


GCAGGTCCTA 


GTCGGGCAGG 


GCCCCTGACC 


CTCCCGGCCT 


1200 


65 


GGCTTCACTC 


TCCCTGACGG 


CTGCCATTGG 


TCCACCCTTT 


CATAGAGAGG 


CCTGCTTTGT 


1260 
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10 



15 



20 



30 



TACAAAGCTC GGGTCTCCCT CCTGCAGCTC GGTTAAGTAC CCGAGGCCTC TCTTAAGATG 1320 

TCCAGGGCCC CAGGCCCGCG GGCACAGCCA GCCCAAACCT TGGGCCCTGG AAGAGTCCTC 1380 

CACCCCATCA CTAGAGTGCT CTGACCCTGG GCTTTCACGG GCCCCATTCC ACC6CCTCCC 1440 

CAACTTGAGC CTGTGACCTT GGGACCAAAG GGGGAGTCCC TCGTCTCTTG TGACTCAGCA 1500 

GAGGCAGTGG CCACGTTCAG GGAGGGGCCG GCTGGCCTGG AGGCTCAGCC CACCCTCCAG 1560 

CTTTTCCTCA GGGTGTCCTG AGGTCCAAGA TTCTGGAGCA ATCTGACCCT TCTCCAAAGG 1620 

CTCTGTTATC AGCTGGGCAG TGCCAGCCAA TCCCTGGCCA TTTGGCCCCA GGGGACGTGG 1680 
GCCCTG 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1843 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS : single 
25 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid (Contig 2535) 
(xi) SEQUENCE DESCRIPTION: SEQ ID Nd:26: 



1686 





GTCTTTTACT 


TTGGCAATGG 


CTGGATTCCT 


ACCCTCATCA 


CGGCCTTTGT 


CCTTGCTACC 


60 


35 


TCTCAGGCCC 


AAGCTGGATG 


GCTGCAACAT 


GATTATGGCC 


ACCTGTCTGT 


CTACAGAAAA 


120 




CCCAAGTGGA 


ACCACCTTGT 


CCACAAATTC 


GTCATTGGCC 


ACTTAAAGGG 


TGCCTCTGCC 


180 




AACTGGTGGA 


ATCATCGCCA 


CTTCCAGCAC 


CACGCCAAGC 


CTAACATCTT 


CCACAAGGAT 


240 


40 


CCCGATGTGA 


ACATGCTGCA 


CGTGTTTGTT 


CTGGGCGAAT 


GGCAGCCCAT 


CGAGTACGGC 


300 




AAGAAGAAGC 


TGAAATACCT 


GCCCTACAAT 


CACCAGCACG 


AATACTTCTT 


CCTGATTGGG 


360 


45 


CCGCCGCTGC 


TCATCCCCAT 


GTATTTCCAG 


TACCAGATCA 


TCATGACCAT 


GATCGTCCAT 


420 


AAGAACTGGG 


TGGACCTGGC 


CTGGGCCGTC 


AGCTACTACA 


TCCGGTTCTT 


CATCACCTAC 


480 




ATCCCTTTCT 


ACGGCATCCT 


GGGAGCCCTC 


CTTTTCCTCA 


ACTTCATCAG 


GTTCCTGGAG 


540 


50 


AGCCACTGGT 


TTGTGTGGGT 


CACACAGATG 


AATCACATCG 


TCATGGAGAT 


TGACCAGGAG 


600 




GCCTACCGTG 


ACTGGTTCAG 


TAGCCAGCTG 


ACAGCCACCT 


GCAACGTGGA 


GCAGTCCTTC 


660 


55 


TTCAACGACT 


GGTTCAGTGG 


ACACCTTAAC 


TTCCAGATTG 


AGCACCACCT 


CTTCCCCACC 


720 




ATGCCCCGGC 


ACAACTTACA 


CAAGATCGCC 


CCGCTGGTGA 


AGTCTCTATG 


TGCCAAGCAT 


780 




GGCATTGAAT 


ACCAGGAGAA 


GCCGCTACTG 


AGGGCCCTGC 


TGGACATCAT 


CAGGTCCCTG 


840 


60 


AAGAAGTCTG 


GGAAGCTGTG 


GCTGGACGCC 


TACCTTCACA 


AATGAAGCCA 


CAGCCCCCGG 


900 




GACACCGTGG 


GGAAGGGGTG 


CAGGTGGGGT 


GATGGCCAGA 


GG7UVTGATGG 


GCTTTTGTTC 


960 




TGAGGGGTGT 


CCGAGAGGCT 


GGTGTATGCA 


CTGCTCACGG 


ACCCCATGTT 


GGATCTTTCT 


1020 
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v.'^V^ i 1 i i \^\^ 


1 TTT 


CTCTTCACAT 


CTCCCCCATA 


GCACCCTGCC 


CTCATGGGAC 


1080 




^ I 1 ^L.^ 


1 k-AGCCGTCA 


GCCATCAGCC 


ATGGCCCTCC 


CAGTGCCTCC 


TAGCCCCTTC 


1140 


5 




L.At3 AvjACaGT G 


GCCACCGGGG 


GTGGCTCTGT 


CCTACCTCCA 


CTCTCTGCCC 


1200 








GCGGTCCATG 


GGTCTGGCCT 


GTGAGTCTCC 


CCTTGCAGCC 


1260 


10 




oCAJ. CA^LLL 


CGCTTTGGTT 


CTTCAGATGC 


TCTTGGGGTT 


CATAGGGGCA 


1320 


1 1 1 \^ 


(suuLAGGGCC 


CCTGACCCTC 


CCGGCCTGGC 


TTCACTCTCC 


CTGACGGCTG 


1380 






ACCCTTTCAT 


AGAGAGGCCT 


GCTTTGTTAC 


AAAGCTCGGG 


TCTCCCTCCT 


1440 




T^.c f^f cr^ 


TAAGTACCCG 


AGGCCTCTCT 


TAAGATGTCC 


AGGGCCCCAG 


GCCCGCG6GC 


1500 




ACAGCCAGCC 


CAAACCTTGG 


GCCCTGGAAG 


AGTCCTCCAC 


CCCATCACTA 


GAGTGCTCTG 


1560 


20 


ACCCTGGGCT 


TTCACGGGCC 


CCATTCCACC 


GCCTCCCCAA 


CTTGAGCCTG 


TGACCTTGGG 


1620 


ACCAAAGGGG 


GAGTCCCTCG 


TCTCTTGTGA 


CTCAGCAGAG 


GCAGTGGCCA 


CGTTCAGGGA 


1680 




GGGGCCGGCT 


GGCCTGGAGG 


CTCAGCCCAC 


CCTCCAGCTT 


TTCCTCAGGG 


TGTCCTGAGG 


1740 


25 


TCC/^GATTC 


TGGAGCAATC 


TGACCCTTCT 


CCAAAGGCTC 


TGTTATCAGC 


TGGGCAGTGC 


1800 




CAGCCAATCC 


CTGGCCATTT 


GGCCCCAGGG 


GACGTGGGCC 


CTG 




1843 



30 (2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2257 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid (Edited Contig 253538a) 
40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 





CAGGGACCTA 


CCCCGCGCTA 


CTTCACCTGG 


GACGAGGTGG 


CCCAGCGCTC 


AGGGTGCGAG 


60 


45 


GAGCG6TGGC 


TAGTGATCGA 


CCGTAAGGTG 


TACAACATCA 


GCGAGTTCAC 


CCGCCGGCAT 


120 


CCAGGGGGCT 


CCCGGGTCAT 


CAGCCACTAC 


GCCGGGCAGG 


ATGCCACGGA 


TCCCTTTGTG 


180 




GCCTTCCACA 


TCAACAAGGG 


CCTTGTGAAG 


AAGTATATGA 


ACTCTCTCCT 


GATTGGAGAA 


240 


50 


CTGTCTCCAG 


AGCAGCCCAG 


CTTTGAGCCC 


ACCAAGAATA 


AAGAGCTGAC 


AGATGAGTTC 


300 




CGGGAGCTGC 


GGGCCACAGT 


GGAGCGGATG 


GGGCTCATGA 


AGGCCAACCA 


TGTCTTCTTC 


360 


55 


CTGCTGTACC 


TGCTGCACAT 


CTTGCTGCTG 


GATGGTGCAG 


CCTGGCTCAC 


CCTTTGGGTC 


420 


TTTGGGACGT 


CCTTTTTGCC 


CTTCCTCCTC 


TGTGCGGTGC 


TGCTCAGTGC 


AGTTCAGCAG 


480 




GCCCAAGCTG 


GATGGCTGCA 


ACATGATTAT 


GGCCACCTGT 


CTGTCTACAG 


AAAACCCAAG 


540 


60 


TGGAACCACC 


TTGTCCACAA 


ATTCGTCATT 


GGCCACTTAA 


AGGGTGCCTC 


TGCCAACTGG 


600 




TGGAATCATC 


GCCACTTCCA 


GCACCACGCC 


AAGCCTAACA 


TCTTCCACAA 


GGATCCCGAT 


660 




GTGAACATGC 


TGCACGTGTT 


TGTTCTGGGC 


GAATGGCAGC 


CCATCGAGTA 


CGGCAAGAAG 


720 
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AAGCTGAAAT 


ACCTGCCCTA 


CAATCACCAG 


CACGAATACT 


TCTTCCTGAT 


TGGGCCGCCG 


780 




CTGCTCATCC 


CCATGTATTT 


CCAGTACCAG 


ATCATCATGA 


CCATGATCGT 


CCATAAGAAC 


840 


c 

J 


TGGGTGGACC 


TGGCCTGGGC 


CGTCAGCTAC 


TACATCCGGT 


TCTTCATCAC 


CTACATCCCT 


900 




TTCTACGGCA 


TCCTGGGAGC 


CCTCCTTTTC 


CTCAACTTCA 


TCAGGTTCCT 


GGAGAGCCAC 


960 


10 


TGGTTTGTGT 


GGGTCACACA 


GATGAATCAC 


ATCGTCATGG 


AGATTGACCA 


GGAGGCCTAC 


1020 


CGTGACTGGT 


TCAGTAGCCA 


GCTGACAGCC 


ACCTGCAACG 


TGGAGCAGTC 


CTTCTTCAAC 


1080 




GACTGGTTCA 


GTGGACACCT 


TAACTTCCAG 


ATTGAGCACC 


ACCTCTTCCC 


CACCATGCCC 


1140 


1 c 
ID 


CGGCACAACT 


TACACAAGAT 


CGCCCCGCTG 


GTGAAGTCTC 


TATGTGCCAA 


GCATGGCATT 


1200 




GAATACCAGG 


AGAAGCCGCT 


ACTGAGGGCC 


CTGCTGGACA 


TCATCAGGTC 


CCTGAAGAAG 


1260 


20 


TCTGGGAAGC 


TGTGGCTGGA 


CGCCTACCTT 


CACAAATGAA 


GCCACAGCCC 


CCGGGACACC 


1320 


GTGGGGAAGG 


GGTGCAGGTG 


GGGTGATGGC 


CAGAGGAATG 


ATGGGCTTTT 


GTTCTGAGGG 


1380 




GTGTCCGAGA 


GGCTGGTGTA 


TGCACTGCTC 


ACGGACCCCA 


TGTTGGATCT 


TTCTCCCTTT 


1440 




CTCCTCTCCT 


TTTTCTCTTC 


ACATCTCCCC 


CATAGCACCC 


TGCCCTCATG 


GGACCTGCCC 


1500 




TCCCTCAGCC 


GTCAGCCATC 


AGCCATGGCC 


CTCCCAGTGC 


CTCCTAGCCC 


CTTCTTCCAA 


1560 


30 


GGAGCAGAGA 


GGTGGCCACC 


GGGGGTGGCT 


CTGTCCTACC 


TCCACTCTCT 


GCCCCTAAAG 


1620 


ATGGGAGGAG 


ACCAGCGGTC 


CATGGGTCTG 


GCCTGTGAGT 


CTCCCCTTGC 


AGCCTGGTCA 


1660 




CTAGGCATCA 


CCCCCGCTTT 


GGTTCTTCAG 


ATGCTCTTGG 


GGTTCATAGG 


GGCAGGTCCT 


1740 




AGTCGGGCAG 


GGCCCCTGAC 


CCTCCCGGCC 


TGGCTTCACT 


CTCCCTGACG 


GCTGCCATTG 


1800 




GTCCACCCTT 


TCATAGAGAG 


GCCTGCTTTG 


TTACAAAGCT 


CGGGTCTCCC 


TCCTGCAGCT 


1860 


40 


CGGTTAAGTA 


CCCGAGGCCT 


CTCTTAAGAT 


GTCCAGGGCC 


CCAGGCCCGC 


GGGCACAGCC 


1920 




AGCCCAAACC 


TTGGGCCCTG 


GAAGAGTCCT 


CCACCCCATC 


ACTAGAGTGC 


TCTGACCCTG 


1980 




GGCTTTCACG 


GGCCCCATTC 


CACCGCCTCC 


CCAACTTGAG 


CCTGTGACCT 


TGGGACCAAA 


2040 


45 


GGGGGAGTCC 


CTCGTCTCTT 


GTGACTCAGC 


AGAGGCAGTG 


GCCACGTTCA 


GGGAGGGGCC 


2100 




GGCTGGCCTG 


GAGGCTCAGC 


CCACCCTCCA 


GCTTTTCCTC 


AGGGTGTCCT 


GAGGTCCAAG 


2160 


50 


ATTCTGGAGC 


AATCTGACCC 


TTCTCCAAAG 


GCTCTGTTAT 


CAGCTGGGCA 


GT6CCAGCCA 


2220 




ATCCCTGGCC 


ATTTGGCCCC 


AGGGGACGTG 


GGCCCTG 






2257 



55 



60 



65 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 411 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: amino acid (Translation of Contig 2692004) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 
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His 


Ala 


Asp 


Arg 


Arg 


Arg 


Glu 


He 


Leu 


Ala 


Lys 


Tyr 


Pro 


Glu 


He 


1 








5 










10 










15 


Lys 


Ser 


Leu 


Met 


Lys 


Pro 


Asp 


Pro 


Asn 


Leu 


He 


Trp 


He 

^ ^ 


He 


He 










20 










25 








30 


Met 


Met 


Val 


Leu 


Thr 


Gin 


Leu 


Gly 


Ala 


Phe 


Tyr 


He 


Val 


Lvs 


A *?r5 










35 










40 








45 


Leu 


Asp 


Trp 


Lys 


Trp 


Val 


He 


Phe 


Gly 


Ala 


Tyr 


Ala 


Phe 


Glv 












50 










55 








60 


Cys 


lie 


Asn 


His 


Ser 
65 


Met 


Thr 


Leu 


Ala 


He 
70 


His 


Glu 


He 


Ala 


His 
75 


Asn 


Ala 


Ala 


Phe 


Gly 


Asn 


Cys 


Lvs 


Ala 


Met 


Trp Asn 
















80 










85 








90 


Gly 


Met 


Phe 


Ala 


Asn 
95 


Leu 


Pro 


He 


Glv 


He 
100 




Tyr 


Ser 


He 


105 


Phe 


Lys 


Arg 


Tyr 


His 
110 


Met 


Asp 


His 


His 


Arc 
115 


Tyr 


Leu 


Glv 


Ala 


120 


Gly 


Val 


Asp 


Val 


Asp 


He 


Pro 


Thr 


ASD 


Phe 




Gly 


1 tp 


Phe 


Phe 










125 










130 








135 


Cys 


Thr 


Ala 


Phe 


Arg 
140 


Lys 


Phe 


He 


Trp 


Val 
145 


T 1 o 


Leu 


Gin 


Pro 


Leu 
150 


Phe 


Tyr 


Ala 


Phe 


Arg 


Pro 


Leu 


Phe 


He 


Asn 


D V* M 

fro 


Lys 




He 


Thr 










155 










160 








165 


Tyr 


Leu 


Glu 


Val 


He 


Asn 


Thr 


Val 


Ala 


Gin 


Val 


Thr 


Phe 


Asp 


He 










170 










175 








180 


Leu 


lie 


Tyr 


Tyr 


Phe 


Leu 


Gly 


He 


Lvs 


Ser 


Leu 


Val 


Tyr 


Met 


Leu 










185 










190 








195 


Ala 


Ala 


Ser 


Leu 


Leu 


Gly 


Leu 


Glv 


Leu 


His 


rro 


He 


Ser 


Gly His 










200 










205 










210 


Phe 


He 


Ala 


Glu 


His 
215 


Tyr 


Met 


Phe 


Leu 


Lvs 
220 


vji.y 


His 


Glu 


Thr 


Tyr 
225 


Ser 


Tyr 


Tyr 


Gly 


Pro 


Leu 


Asn 


Leu 


Leu 


Thr 


rne 


Asn 


Val 


Gly 


Tyr 










230 










235 

W w 








240 


His 


Asn 


Glu 


His 


His 


Asp 


Phe 


Pro 


Asn 


He 


Pro 


Gly Lys 


Ser 


Leu 










245 










250 










255 


Pro 


Leu 


Val 


Arg 


Lys 


He 


Ala 


Ala 


Glu 


Tvr 


Tyr 


Asp 


Asn 


Leu 


Pro 










260 










265 








270 


His 


Tyr 


Asn 


Ser 


Trp 
275 


He 


Lvs 


Val 




TvT 
280 


Asp 


Phe 


Val 


Met 


Asp 

285 


Asp 


Thr 


He 


Ser 


Pro 


Tvr 


Ser 


Ara 


Met 




Arg 


His 


Gin 


Lys 


Gly 










290 










295 








300 


Glu 


Met 


Val 


Leu 


Glu 


* * * 


He 


Ser 


Leu 


Val 


Pro 


Lys 


Gly 


Phe 


Phe 










305 










310 






315 


Ser 


Lys 


Thr 


Leu 


Asp 


Asp 


Lys 


Met 


Glu 


Phe 


Leu 


His 


Tyr 


<* * 4> 


Thr 










320 










325 








330 


*** 


Asp 


Gin 


* ♦ * 


Cys 
335 


Ser 


Glu 


Ala 


Pro 


Leu 
340 


Ala 


Gin 


Phe 


Gin 


Ser 
345 


Lys 


Ser 


Ser 


Val 


He 


Pro 


Arg 


Ser 


Glu 


Ser 


Gly 


Phe 


* * * 


Thr 


Val 










350 










355 








360 


Ser 


Leu 


Thr 


Leu 


Tyr 


Cys 


Ser 


Val 


Ser 


Leu 


Thr 


Gly Asn 


Leu 


*** 










365 










370 










375 


Leu 


Val 


Tyr 


Tyr 


Arg 


His 




Gly Cys 


Phe 


Thr 


His 


Val 


Cys 


His 










380 










385 








390 


Phe 


He 


Ser 


He 


Ser 


Phe 


Lys 


Lys 


Leu 


Leu 


Lys 


Ser 


Tyr 


Phe 


Ala 










400 










405 








410 



Arg 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 218 amino acids 

(B) TYPE: amino acid 



113 
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15 



(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

^ (ii) MOLECULE TYPE: amino acid (Translation of Contig 2153526) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 

Tyr Leu Leu Arg Pro Leu Leu Pro His Leu Cys Ala Thr lie Gly 

10 1 5 10 15 

Ala Glu Ser Phe Leu Gly Leu Phe Phe lie Val Arg Phe Leu Glu 

20 25 30 

Ser Asn Trp Phe Val Trp Val Thr Gin Met Asn His He Pro Met 

35 40 45 

His He Asp His Asp Arg Asn Met Asp Trp Val Ser Thr Gin Leu 

50 55 60 

Gin Ala Thr Cys Asn Val His Lys Ser Ala Phe Asn Asp Trp Phe 

65 70 75 

Ser Gly His Leu Asn Phe Gin He Glu His His Leu Phe Pro Thr 

20 80 85 90 

Met Pro Arg His Asn Tyr His Lys Val Ala Pro Leu Val Gin Ser 

95 100 105 

Leu Cys Ala Lys His Gly He Glu Tyr Gin Ser Lys Pro Leu Leu 

110 115 120 

Ser Ala Phe Ala Asp He He His Ser Leu Lys Glu Ser Gly Gin 

125 130 135 

Leu Trp Leu Asp Ala Tyr Leu His Gin **♦ Gin Gin Pro Pro Cys " 

140 145" 150 

m ^^"^ 

•^0 155 160 165 

Gly Ala *** Gly Thr Met Pro Leu *** Phe Asn Thr Gin Arg Gly 

170 175 180 

Leu Gly Leu Gly Thr *** Ser Leu *** Leu Lys Leu Leu Pro Phe 

185 190 195 

He Phe *** Pro Gin Phe *♦* Asp Pro Lys Trp Gly Val Asp Thr 

200 205 210 
Glu Val Pro Arg Arg Glu Gly Ala 

215 



25 



35 



40 



45 



50 



55 



60 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: amino acid (Translation of Contig 3506132) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 



Val 


Phe 


Tyr 


Phe 


Gly Asn 


Gly 


Trp 


He 


Pro 


Thr 


Leu He 


Thr 


Ala 


1 








5 








10 








15 


Phe 


Val 


Leu 


Ala 


Thr Ser 


Gin 


Ala 


Gin 


Ala 


Gly 


Trp Leu 


Gin 


His 










20 








25 






30 


Asp 


Tyr 


Gly 


His 


Leu Ser 


Val 


Tyr 


Arg 


Lys 


Pro 


Lys Trp 


Asn 


His 










35 








40 






45 


Leu 


Val 


His 


Lys 


Phe Val 


He 


Gly 


His 


Leu 


Lys 


Gly Ala 


Ser 


Ala 
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10 



20 



25 



30 



35 



40 



45 



50 



55 



60 



50 55 60 

Asn Trp Trp Asn His Arg His Phe Gin His His Ala Lys Pro Asn 

65 70 75 

Leu Gly Glu Trp Gin Pro lie Glu Tyr Gly Lys Xxx 

80 85 



(2) INFORMATION FOR SEQ ID N0:31: 



(i) SEQUENCE CHARACTERISTICS: 

iPi) LENGTH: 306 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
15 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: amino acid (Translation of Contig 3854933) 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:31: 



Gin 


Gly 


Pro 


Thr 


Pro 


Arg 


Tyr 


Phe 


Thr 


Trp 


Asp 


Glu 


Val 


Ala 


Gin 


1 








5 










10 










15 


Arg 


Ser 


Gly Cys 


Glu 


Glu 


Arg 


Trp 


Leu 


Val 


He 


Asp 


Arg 


Lys 


Val 




















25 










30 


Tyr 


Asn 


He 


Ser 


Glu 


Phe 


Thr 


Arg 


Arg 


His 


Pro 


Gly Gly 


Ser 


Arg 










35 










40 










45 


Val 


He 


Ser 


H^S 


Tyr 


Ala 


Gly 


Gin 


Asp 


Ala 


Thr 


Asp 


Pro 


Phe 


Val 










50 










55 










60 


Ala 


Phe 


His 


He 


Asn 


Lys 


Gly 


Leu 


Val 


Lys 


Lys 


Tyr 


Met 


Asn 


Ser 










65 










70 










75 


Leu 


Leu 


He 


Gly 


Glu 


Leu 


Ser 


Pro 


Glu 


Gin 


Pro 


Ser 


Phe 


Glu 


Pro 










80 










85 










90 


Thr 


Lys 


Asn 


Lys 


Glu 


Leu 


Thr 


Asp 


Glu 


Phe 


Arg 


Glu 


Leu 


Arg 


Ala 










95 










100 










105 


Thr 


Val 


Glu 


Arg 


Met 


Gly 


Leu 


Met 


Lys 


Ala 


Asn 


His 


Val 


Phe 


Phe 










110 










115 










120 


Leu 


Leu 


Tyr 


Leu 


Leu 


His 


He 


Leu 


Leu 


Leu 


Asp 


Gly 


Ala 


Ala 


Trp 










125 










130 










135 


Leu 


Thr 


Leu 


Trp 


Val 


Phe 


Gly 


Thr 


Ser 


Phe 


Leu 


Pro 


Phe 


Leu 


Leu 










140 










145 










150 


Cys 


Ala 


Val 


Leu 


Leu 


Ser 


Ala 


Val 


Gin 


Ala 


Gin 


Ala 


Gly 


Trp 


Leu 










155 










160 










165 


Gin 


His 


Asp 


Phe 


Gly 


His 


Leu 


Ser 


Val 


Phe 


Ser 


Thr 


Ser 


Lys 


Trp 










170 










175 










180 


Asn 


His 


Leu 


Leu 


His 


His 


Phe 


Val 


He 


Gly 


His 


Leu 


Lys 


Gly 


Ala 










185 










190 










195 


Pro 


Ala 


Ser 


Trp 


Trp 


Asn 


His 


Met 


His 


Phe 


Gin 


His 


His 


Ala 


Lys 










200 










205 










210 


Pro 


Asn 


Cys 


Phe 


Arg 


Lys 


Asp 


Pro 


Asp 


He 


Asn 


Met 


His 


Pro 


Phe 










215 










220 










225 


Phe 


Phe 


Ala 


Leu 


Gly 


Lys 


He 


Leu 


Ser 


Val 


Glu 


Leu 


Gly 


Lys 


Gin 










230 










235 






240 


Lys 


Lys 


Lys 


Tyr 


Met 


Pro 


Tyr 


Asn 


His 


Gin 


His 


Xxx 


Tyr 


Phe 


Phe 










245 










250 








255 


Leu 


He 


Gly 


Pro 


Pro 


Ala 


Leu 


Leu 


Pro 


Leu 


Tyr 


Phe 


Gin 


Trp 


Tyr 










260 










265 








270 


He 


Phe 


Tyr 


Phe 


Val 


He 


Gin 


Arg 


Lys 


Lys 


Trp 


Val 


Asp 


Leu 


Ala 










275 










280 










285 


Trp 


He 


Ser 


Lys 


Gin 


Glu 


Tyr 


Asp 


Glu 


Ala 


Gly 


Leu 


Pro 


Leu 


Ser 










290 










295 








300 


Thr 


Ala 


Asn 


Ala 


Ser 


Lys 





















115 



305 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 566 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: amino acid (Translation of Contig 2511785) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 



* * • 

His 


Leu 


Lys 


Gly 


Ala 


Ser 


Ala 


Asn 


Trp 


Trp 


Asn 


His 


Arg 


His 


Phe 


1 








5 










10 










15 


Gin 


His 


His 


Ala 


Lys 


Pro 


Asn 


He 


Phe 


His 


Lys 


Asp 


Pro 


Asp 


Val 










20 










25 










30 


Asn 


Met 


Leu 


His 


Val 


Phe 


Val 


Leu 


Gly 


Glu 


Trp 


Gin 


Pro 


He 


Glu 










35 










40 










45 


Tyr 


Gly 


Lys 


Lys 


Lys 


Leu 


Lys 


Tyr 


Leu 


Pro 


Tyr 


Asn 


His 


Gin 


His 










50 










55 










60 


Glu 


Tyr 


Phe 


Phe 


Leu 


He 


Gly 


Pro 


Pro 


Leu 


Leu 


He 


Pro 


Met 


Tyr 










65 










70 










75 


Phe 


Gin 


Tyr 


Gin 


lie 


He 


Met 


Thr 


Met 


He 


Val 


His 


Lys 


Asn 


Trp 










60 










85 








90 


Val 


Asp 


Leu 


Ala 


Trp 


Ala 


Val 


Ser 


Tyr 


Tyr 


He 


Arg 


Phe 


Phe 


He 










95 










100 










105 


Thr 


Tyr 


lie 


Pro 


Phe 


Tyr 


Gly 


He 


Leu 


Gly 


Ala 


Leu 


Leu 


Phe 


Leu 










110 










115 










120 


Asn 


Phe 


lie 


Arg 


Phe 


Leu 


Glu 


Ser 


His 


Trp 


Phe 


Val 


Trp 


Val 


Thr 










125 










130 








135 


Gin 


Met 


Asn 


His 


He 


Val 


Met 


Glu 


He 


Asp 


Gin 


Glu 


Ala 


Tyr 


Arg 










140 










145 








150 


Asp 


Trp 


Phe 


Ser 


Ser 


Gin 


Leu 


Thr 


Ala 


Thr 


Cys 


Asn 


Val 


Glu 


Gin 










155 










160 










165 


Ser 


Phe 


Phe 


Asn 


Asp 


Trp 


Phe 


Ser 


Gly 


His 


Leu 


Asn 


Phe 


Gin 


He 










170 










175 










180 


Glu 


His 


His 


Leu 


Phe 


Pro 


Thr 


Met 


Pro 


Arg 


His 


Asn 


Leu 


His 


Lys 










185 










190 










195 


lie 


Ala 


Pro 


Leu 


Val 


Lys 


Ser 


Leu 


Cys 


Ala 


Lys 


His 


Gly 


He 


Glu 










200 










205 






210 


Tyr 


Gin 


Glu 


Lys 


Pro 


Leu 


Leu 


Arg 


Ala 


Leu 


Leu 


Asp 


He 


He 


Arg 










215 










220 








225 


Ser 


Leu 


Lys 


Lys 


Ser 


Gly 


Lys 


Leu 


Trp 


Leu 


Asp 


Ala 


Tyr 


Leu 


His 










230 










235 






240 


Lys 




Ser 


His 


Ser 


Pro 


Arg 


Asp 


Thr 


Val 


Gly 


Lys 


Gly 


Cys 


Arg 










245 










250 










255 


Trp 


Gly 


Asp 


Gly 


Gin 


Arg 


Asn 


Asp 


Gly 


Leu 


Leu 


Phe 


*** 


Gly 


Val 










260 










265 








270 


Ser 


Glu 


Arg 


Leu 


Val 


Tyr 


Ala 


Leu 


Leu 


Thr 


Asp 


Pro 


Met 


Leu 


Asp 










275 










280 








285 


Leu 


Ser 


Pro 


Phe 


Leu 


Leu 


Ser 


Phe 


Phe 


Ser 


Ser 


His 


Leu 


Pro 


His 










290 










295 










300 


Ser 


Thr 


Leu 


Pro 


Ser 


Trp 


Asp 


Leu 


Pro 


Ser 


Leu 


Ser 


Arg 


Gin 


Pro 










305 










310 








315 


Ser 


Ala 


Met 


Ala 


Leu 


Pro 


Val 


Pro 


Pro 


Ser 


Pro 


Phe 


Phe 


Gin 


Gly 


Ala 








320 










325 










330 


Glu 


Arg 


Trp 


Pro 


Pro 


Gly 


Val 


Ala 


Leu 


Ser 


Tyr 


Leu 


His 


Ser 










335 










340 








345 
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Leu 


Pro 


Leu 


Lys 


Met 
350 


Gly 


Gly 




Cys 


Glu 


Ser 


Pro 


Leu 
365 


Ala 


Ala 


5 


Leu 


Val 


Leu 


Gin 


Met 
380 


Leu 


Leu 




Arg 


Ala 


Gly 


Pro 


Leu 
400 


Thr 


Leu 




Ara 


Leu 


Pro 


Leu 


Val 


His 


Pro 


10 








415 








Gin 


Ser 


Ser 


Gly 


Leu 
430 


Pro 


Pro 




Leu 


Ser 


* 4r * 


Asp 


Val 


Gin 


Gly 


15 










445 






Pro 


Asn 


Leu 


Gly 


Pro 
460 


Trp 


Lys 




Ala 


Leu 


Thr 


Leu 


Gly 
475 


Phe 


His 




Thr 




Ala 


Cvs 


ASD 


Leu 


Glv 


20 
















Leu 




Leu 


Ser 


Arg 
505 


Gly 


Ser 




Trp 


Pro 


Gly 


Gly 


Ser 


Ala 


His 


25 










520 






Leu 


Arg 


Ser 


Lys 


He 
535 


Leu 


Glu 




Leu 


Leu 


Ser 


Ala 


Gly 
550 


Gin 


Cys 


30 


Pro 


Gly 


Asp 


Val 


Gly 


Pro 


Xxx 










565 







Asp 


Gin 


Arg 
355 


Ser 


Met 


Gly 


Leu 


Ala 
360 


Trp 


Ser 


Leu 
370 


Gly 


He 


Thr 


Pro 


Ala 
375 


Gly 


Phe 


He 


Gly Ala 


Gly 


Pro 


Ser 






385 










390 

^ ^0 


Pro 


Ala 


Trp 
405 


Leu 


His 


Ser 


Pro 


♦ * ♦ 

410 


Phe 


He 


Glu 
420 


Arg 


Pro 


Ala 


Leu 


Leu 
425 


Ala 


Ala 


Ara 
435 


Leu 


Ser 


Thr 


Arg 


Gly 
440 


Pro 


Arg 


Pro 
450 


Ala 


Glv 


Thr 


Ala 


Ser 
455 


Ser 


Pro 


Pro 
465 


Pro 


His 


His 


« '*"* 


Ser 
470 


Gly 


Pro 


His 
4 fl n 

1 o u 


Ser 


Thr 


Ala 


Ser 


Pro 

*i O J 


Thr 


Lys 


Gly 
495 


Gly 


Val 


Pro 


Arg 


Leu 
500 


Gly 


His 


Val 


Gin 


Gly 


Gly Ala 


Gly 






510 










515 


Pro 


Pro 


Ala 
525 


Phe 


Pro 


Gin 


Gly 


Val 
530 


Gin 


Ser 


Asp 
540 


Pro 


Ser 


Pro 


Lys 


Ala 
545 


Gin 


Pro 


He 
555 


Pro 


Gly 


His 


Leu 


Ala 

560 



(2) INFORMATION FOR SEQ ID NO: 33: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 619 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

40 

(ii) MOLECULE TYPE: amino acid (Translation of Contig 2535) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

45 





Val 


Phe 


Tyr 


Phe 


Gly 


Asn 


Gly 


Trp 


He 


Pro 


Thr 


Leu 


He 


Thr 


Ala 




1 








5 










10 










15 


50 


Phe 


Val 


Leu 


Ala 


Thr 


Ser 


Gin 


Ala 


Gin 


Ala 


Gly Trp 


Leu 


Gin 


His 










20 










25 










30 




Asp 


Tyr 


Gly 


His 


Leu 
35 


Ser 


Val 


Tyr 


Arg 


Lys 
40 


Pro 


Lys 


Trp 


Asn 


His 
45 




Leu 


Val 


His 


Lys 


Phe 


Val 


He 


Gly 


His 


Leu 


Lys 


Gly 


Ala 


Ser 


Ala 


55 










50 










55 










60 


Asn 


Trp 


Trp 


Asn 


His 
65 


Arg 


His 


Phe 


Gin 


His 
70 


His 


Ala 


Lys 


Pro 


Asn 
75 




He 


Phe 


His 


Lys 


Asp 
80 


Pro 


Asp 


Val 


Asn 


Met 
85 


Leu 


His 


Val 


Phe 


Val 
90 


60 


Leu 


Gly 


Glu 


Trp 


Gin 


Pro 


He 


Glu 


Tyr 


Gly 


Lys 


Lys 


Lys 


Leu 


Lys 










95 










100 










105 




Tyr 


Leu 


Pro 


Tyr 


Asn 
110 


His 


Gin 


His 


Glu 


Tyr 
115 


Phe 


Phe 


Leu 


He 


Gly 
120 
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Pro Pro Leu 
Thr Met He 
Ser Tyr Tyr 
He Leu Gly 
Ser His Trp 
Glu He Asp 
Thr Ala Thr 
Ser Gly His 
Met Pro Arg 
Leu Cys Ala 
Arg Ala Leu 
Leu Trp Leu 
Asp Thr Val 
Asp Gly Leu 
Leu Leu Thr 
Phe Phe Ser 
Leu Pro Ser 
Pro Pro Ser 
Val Ala Leu 
Asp Gin Arg 
Trp Ser Leu 
Gly Phe He 
Pro Ala Trp 
Phe He Glu 
Ala Ala Arg 
Pro Arg Pro 
Ser Pro Pro 
Gly Pro His 
Thr Lye Gly 
Gly His Val 
Pro Pro Ala 



Leu 


He 


Pro 




125 




Val 


His 


Lys 




140 




He 


Arg 


Phe 




155 




Ala 


Leu 


Leu 




* mm 

170 




Phe 


Val 


Trp 




185 




Gin 


Glu 


Ala 




200 




Cys 


Asn 


Val 




215 




Leu 


Asn 


Phe 




230 




His 


Asn 


Leu 




245 




Lys 


His 


Gly 




260 




Leu 


Asp 


He 




275 




Asp 


Ala 


Tyr 




290 




Gly 


Lys 


Gly 




305 




Leu 


Phe 


** * 




320 




Asp 


Pro 


Met 




335 




Ser 


His 


Leu 




350 




Leu 


Ser 


Arg 




365 




Pro 


Phe 


Phe 




380 




Ser 


Tyr 


Leu 




400 




Ser 


Met 


Gly 




415 




Gly 


He 


Thr 




430 




Gly 


Ala 


Gly 




445 




Leu 


His 


Ser 




460 




Arg 


Pro 


Ala 




475 




Leu 


Ser 


Thr 




490 




Ala 


Gly 


Thr 




505 




Pro 


His 


His 




520 




Ser 


Thr 


Ala 




535 




Gly 


Val 


Pro 




550 




Gin 


Gly 


Gly 




565 




Phe 


Pro 


Gin 




580 





Met Tyr Phe 
Asn Trp Val 
Phe He Thr 
Phe Leu Asn 
Val Thr Gin 
Tyr Arg Asp 
Glu Gin Ser 
Gin He Glu 
His Lys He 
He Glu Tyr 
He Arg Ser 
Leu His Lys 
Cys Arg Trp 
Gly Val Ser 
Leu Asp Leu 
Pro His Ser 
Gin Pro Ser 
Gin Gly Ala 
His Ser Leu 
Leu Ala Cys 
Pro Ala Leu 
Pro Ser Arg 
Pro Arg 
Leu Leu Gin 
Arg Gly Leu 
Ala Ser Pro 
*** Ser Ala 
Ser Pro Thr 
Arg Leu Leu 
Ala Gly Trp 
Gly Val Leu 
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Gin 


Tyr 


Gin 


130 






Asp 


Leu 


Ala 


145 






Tyr 


He 


Pro 


160 






Phe 


He 


Arg 


175 






Met 


Asn 


His 


190 






Trp 


Phe 


Ser 


205 






Phe 


Phe 


Asn 


220 






His 


His 


Leu 


235 






Ala 


Pro 


Leu 


250 






Gin 


Glu 


Lys 


265 






Leu 


Lys 


Lys 


280 






* * * 


Ser 


His 


295 






Gly 


Asp 


Gly 


310 






Glu 


Arg 


Leu 


325 


- 




Ser 


Pro 


Phe 


340 






Thr 


Leu 


Pro 


355 






Ala 


Met 


Ala 


370 






Glu 


Arg 


Trp 


385 






Pro 


Leu 


Lys 


405 






Glu 


Ser 


Pro 


420 






Val 


Leu 


Gin 


435 






Ala 


Gly 


Pro 


450 






Leu 


Pro 


Leu 


465 






Ser 


Ser 


Gly 


480 






Ser 


* ** 


Asp 


495 






Asn 


Leu 


Gly 


510 






Leu 


Thr 


Leu 


525 






* ★ * 


Ala 


Cys 


540 








Leu 


Ser 


555 






Pro 


Gly 


Gly 


570 






Arg 


Ser 


Lys 


585 







He 


He 


Met 






135 


Trp 


Ala 


Val 






150 


Phe 


Tyr 


Gly 






165 


Phe 


Leu 


Glu 






180 


He 


Val 


Met 






195 


Ser 


Gin 


Leu 






210 


Asp 


Trp 


Phe 






225 


Phe 


Pro 


Thr 






240 


Val 


Lys 


Ser 






255 


Pro 


Leu 


Leu 






270 


Ser 


Gly 


Lys 






285 


Ser 


Pro 


Arg 






300 


Gin 


Arg 


Asn 






315 


Val 


Tyr 


Ala 






330 


Leu 


Leu 


Ser 






345 


Ser 


Trp 


Asp 






360 


Leu 


Pro 


Val 






375 


Pro 


Pro 


Gly 






390 


Met 


Gly 


Gly 






410 


Leu 


Ala 


Ala 






425 


Met 


Leu 


Leu 






440 


Leu 


Thr 


Leu 






455 


Val 


His 


Pro 






470 


Leu 


Pro 


Pro 






485 


Val 


Gin 


Gly 






500 


Pro 


Trp 


Lys 






515 


Gly 


Phe 


His 






530 


Asp 


Leu 


Gly 






545 


Arg 


Gly 


Ser 






560 


Ser 


Ala 


His 






575 


He 


Leu 


Glu 






590 



Gin Ser Asp Pro Ser Pro Lys Ala Leu Leu Ser Ala Gly Gin Cys 

595 600 605 

Gin Pro lie Pro Gly His Leu Ala Pro Gly Asp Val Gly Pro Xxx 

610 615 620 



(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 757 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: amino acid (Translation of Contig 253538a) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 



Gin Gly Pro 
1 

Arg Ser Gly 
Tyr Asn lie 
Val He Ser 
Ala Phe His 
Leu Leu He 
Thr Lys Asn 
Thr Val Glu 
Leu Leu Tyr 
Leu Thr Leu 
Cys Ala Val 
Leu Gin His 
Trp Asn His 
Ala Ser Ala 
Lys Pro Asn 
Val Phe Val 
Lys Leu Lys 
Leu He Gly 
He He Met 
Trp Ala Val 
Phe Tyr Gly 



Thr Pro Arg 
5 

Cys Glu Glu 
20 

Ser Glu Phe 
35 

His Tyr Ala 
50 

He Asn Lys 
65 

Gly Glu Leu 
80 

Lys Glu Leu 
95 

Arg Met Gly 

110 
Leu Leu His 

125 
Trp Val Phe 

140 
Leu Leu Ser 

155 
Asp Tyr Gly 

170 
Leu Val His 

185 
Asn Trp Trp 

200 
He Phe His 

215 
Leu Gly Glu 

230 
Tyr Leu Pro 

245 
Pro Pro Leu 

260 
Thr Met He 

275 
Ser Tyr Tyr 

290 
He Leu Gly 

305 



Tyr Phe Thr 
Arg Trp Leu 
Thr Arg Arg 
Gly Gin Asp 
Gly Leu Val 
Ser Pro Glu 
Thr Asp Glu 
Leu Met Lys 
He Leu Leu 
Gly Thr Ser 
Ala Val Gin 
His Leu Ser 
Lys Phe Val 
Asn His Arg 
Lys Asp Pro 
Trp Gin Pro 
Tyr Asn His 
Leu He Pro 
Val His Lys 
He Arg Phe 
Ala Leu Leu 



Trp 


Asp 


Glu 


10 






Val 


He 


Asp 


25 






His 


Pro 


Gly 


40 






Ala 


Thr 


Asp 


55 






Lys 


Lys 


Tyr 


70 






Gin 


Pro 


Ser 


85 






Phe 


Arg 


Glu 


100 






Ala 


Asn 


His 


115 






Leu Asp Gly 


130 






Phe 


Leu 


Pro 


145 






Gin 


Ala 


Gin 


160 






Val 


Tyr 


Arg 


175 






He 


Gly 


His 


190 






His 


Phe 


Gin 


205 






Asp 


Val 


Asn 


220 






He 


Glu 


Tyr 


235 






Gin 


His 


Glu 


250 






Met 


Tyr 


Phe 


265 






Asn 


Trp 


Val 


280 






Phe 


He 


Thr 


295 






Phe 


Leu 


Asn 


310 







Val Ala Gin 
15 

Arg Lys Val 
30 

Gly Ser Arg 
45 

Pro Phe Val 
60 

Met Asn Ser 
75 

Phe Glu Pro 
90 

Leu Arg Ala 
105 

Val Phe Phe 
120 

Ala Ala Trp 
135 

Phe Leu Leu 
150 

Ala Gly Trp 
165 

Lys Pro Lys 
180 

Leu Lys Gly 
195 

His His Ala 
210 

Met Leu His 
225 

Gly Lys Lys 
240 

Tyr Phe Phe 

255 

Gin Tyr Gin 
270 

Asp Leu Ala 
285 

Tyr He Pro 
300 

Phe He Arg 
315 
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Phe 


Leu 


Glu 


Ser 


His 

<J ^ V/ 


He 


Val 


Met 


Glu 


He 
«^ J «j 


Ser 


Gin 


Leu 


Thr 


Ala 
so 

•J U 


Asp 


Trp 


Phe 


Ser 


Gly 

w ^ 


Phe 


Pro 


Thr 


Met 


Pro 

■J o u 


Val 


Lys 


Ser 


Leu 


Cys 


Pro 


Leu 


Leu 


Arg 


Ala 

1 J. □ 


Ser 


Gly 


Lys 


Leu 


Trp 


Ser 


Pro 


Arg 


Asp 


Thr 

4 4 ^ 

H H 3 


Gin 


Arg 


Asn 


Asp 


Gly 


Val 


Tyr 


Ala 


Leu 


Leu 

t t ^ 


Leu 


Leu 


Ser 


Phe 


Phe 


Ser 


Trp 


Asp 


Leu 


Pro 


Leu 


Pro 


Val 


Pro 


Pro 


Pro 


Pro 


Gly 


Val 


Ala 


Met 


Gly 


Gly 


Asp 


Gin 

DOM 


Leu 


Ala 


Ala 


Trp 


Ser 

o OS 


Met 


Leu 


Leu 


Gly 


Phe 


Leu 


Thr 


Leu 


Pro 


Ala 


Val 


His 


Pro 


Phe 


lie 
o ± u 


Leu 


Pro 


Pro 


Ala 


Ala 


Val 


Gin 


Gly 


Pro 


Arg 


Pro 


Trp 


Lys 


Ser 


Pro 


Gly 


Phe 


His 


Gly 


Pro 

D r U 




J>i w u 






iajfS 

685 


Arg 


Gly 


Ser 


Gly 


His 
700 


Ser 


Ala 


His 


Pro 


Pro 
715 


He 


Leu 


Glu 


Gin 


Ser 
730 


Gly 


Gin 


Cys 


Gin 


Pro 
745 


Gly 


Pro 


Xxx 







Trp 


Phe 


Val 


Trp 


Val 

"3 O 


Asp 


Gin 


Glu 


Ala 


Tyr 
o4 0 


Thr 


Cys 


Asn 


Val 


Glu 

"3 C C 


His 


Leu 


Asn 


Phe 


Gin 
"in fx 


Arg 


His 


Asn 


Leu 


His 

o o c 


Ala 


Lys 


His 


Gly 


He 
40b 


Leu 


Leu 


Asp 


He 


He 

42 0 


Leu 


Asp 


Ala 


Tyr 


Leu 
435 


Val 


Gly 


Lys 


Gly 


Cys 

A c r\ 

450 


Leu 


Leu 


Phe 




Gly 

4 15 


Thr 


Asp 


Pro 


Met 


Leu 

A O f\ 

480 


Ser 


Ser 


His 


Leu 


Pro 

Ana 

49b 


Ser 


Leu 


Ser 


Arg 


Gin 
510 


Ser 


Pro 


Phe 


Phe 


Gin 

c o c 

52 5 


Leu 


Ser 


Tyr 


Leu 


His 
540 


Arg 


Ser 


Met 


Gly 


Leu 

c e e 

555 


Leu 


Gly 


He 


Thr 


Pro 

C T 

570 


He 


Gly 


Ala 


Gly 


Pro 

e A c 

585 


Trp 


Leu 


His 


Ser 


Pro 
600 


Glu 


Arg 


Pro 


Ala 


Leu 
615 


Arg 


Leu 


Ser 


Thr 


Arg 
630 


Pro 


Ala 


Gly 


Thr 


Ala 
D4b 


Pro 


Pro 


His 


His 


*** 

ooO 


His 


Ser 


Thr 


Ala 


Ser 
675 


Gly 


Gly 


Val 


Pro 


Arg 

690 


Val 


Gin 


Gly 


Gly 


Ala 
705 


Ala 


Phe 


Pro 


Gin 


Gly 
720 


Asp 


Pro 


Ser 


Pro 


Lys 
735 


He 


Pro 


Gly 


His 


Leu 
750 
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Thr Gin Met Asn His 

330 

Arg Asp Trp Phe Ser 

345 

Gin Ser Phe Phe Asn 

360 

He Glu His His Leu 

375 

Lys He Ala Pro Leu 

390 

Glu Tyr Gin Glu Lys 

410 

Arg Ser Leu Lys Lys 

425 

His Lys *** Ser His 

440 

Arg Trp Gly Asp Gly 

455 

Val Ser Glu Arg Leu 

470 

Asp Leu Ser Pro Phe 

485 

His Ser Thr Leu Pro 

500 

Pro Ser Ala Met Ala 

515 

Gly Ala Glu Arg Trp 

530 

Ser Leu Pro Leu Lys 

545 

Ala Cys Glu Ser Pro 

560 

Ala Leu Val Leu Gin 

575 

Ser Arg Ala Gly Pro 

590 

*** Arg Leu Pro Leu 

605 

Leu Gin Ser Ser Gly 

620 

Gly Leu Ser *** Asp 

635 

Ser Pro Asn Leu Gly 

650 

Ser Ala Leu Thr Leu 

665 

Pro Thr Ala Cys 

680 

Leu Leu *** Leu Ser 

695 

Gly Trp Pro Gly Gly 

710 

Val Leu Arg Ser Lys 

725 

Ala Leu Leu Ser Ala 

740 

Ala Pro Gly Asp Val 

755 
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What is claimed is : 

1 . An isolated nucleic acid comprising: 

a nucleotide sequence depicted in a SEQ ID NO. 1 

2. A polypeptide encoded by said nucleic acid of claim 1 . 

5 3. A purified or isolated polypeptide comprising an amino acid 

sequence depicted in SEQ ID NO: 2. 

4. An isolated nucleic acid encoding the polypeptide of SEQ ID 

NO: 2. 

5. An isolated nucleic acid comprising: 

1 0 a nucleotide sequence which encodes a polypeptide that desaturates a 

fatty acid molecule at carbon 5 from the carboxyl end of said fatty acid 
molecule. 

6. The isoloated nucleic acid according to Claim S, wherein said 
nucleotide sequence is derived from eukaryotic cell. 

15 7. The isolated nucleic acid according to Claim 6, wherein said 

eukaryotic cell is a fungal cell. 

8. The isolated nucleic acid according to Claim 7, wherein said 
fungal cell is of the genus Mortierella. 

9. The isolated nucleic acid according to Claim 8, wherein said 
20 Mortierella cell is of the species Mortierella alpina. 

10. The isolated nucleic acid according to Claim wherein said 
nucleotide sequence anneals to a nucleotide sequence depicted in SEQ ID 
NO: 1. 

1 1 . The nucleic acid of claim 10, wherein said nucleotide sequence 
25 encodes an amino acid sequence depicted in SEQ ID NO: 2. 

12. The nucleic acid of claim 11, wherein said amino acid sequence 
depicted in SEQ ID NO: 2 is selected from the group consisting of amino acid 
residues 30-38, 41-44, 171-175, 203-212, and 387-394. 
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13. An isolated or purified polypeptide which desaturates a fatty acid 
molecule at carbon 5 from the carboxyl end of said fatty acid molecule. 

14. An isolated nucleic acid comprising: 

a nucleotide sequence which is substantially identical to a sequence of at 
5 least 50 nucleotides in SEQ ID NO 1 . 

1 5. An isolated nucleic acid sequence having at least about 50% 
identity to SEQ ID NO 1 . 

16. A nucleic acid construct comprising: 

a nucleotide sequence depicted in a SEQ ID NO: 1 linked to a 
1 0 heterologous nucleic acid. 

17. A nucleic acid construct comprising: 

a nucleotide sequence depicted in a SEQ ID NO: 1 operably linked to a 
promoter. 

18. The nucleic acid construct of claim 17, wherein said promoter is 
1 5 functional in a microbial cell. 

1 9. The nucleic acid construct of claim 1 8, wherein said microbial 
cell is a yeast cell. 

20. The nucleic acid construct of claim 17, wherein said nucleotide 
sequence is derived from a fungus. 

20 21. The nucleic acid according to Claim 1 9, wherein said fungus is 

of the genus Mortierella. 

22. The nucleic acid according to Claim 20, wherein said fungus is 
of the species Morlierella alpina, 

23. A nucleic acid construct comprising: 

25 a nucleotide sequence which encodes a polypeptide comprising an amino acid 

sequence which corresponds to or is complementary to an amino acid sequence 
depicted in SEQ ID NO: 2, wherein said nucleotide sequence is operably linked 
to a promoter which is functional in a host cell, wherein said nucleotide 
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sequence encodes a polypeptide which desaturates a fatty acid molecule at 
carbon 5 from the carboxyl end of a fatty acid molecule. 

24. A nucleic acid construct comprising: 

a nucleotide sequence which encodes a functionally active A5- 
S desaturase, said desaturase having an amino acid sequence which corresponds 
to or is complementary to all of or a portion of an amino acid sequence depicted 
in a SEQ ID NO: 2, wherein said nucleotide sequence is operably linked to a 
promoter functional in a host cell. 

25. A recombinant yeast cell comprising: 

10 a nucleic construct according to Claim 23 or Claim 24. 

26. The recombinant yeast cell according to Claim 25, wherein said 
yeast cell is a Saccharomyces cell. 

27. A host cell comprising: 

at least one copy of a nucleotide sequence which encodes a polypeptide 
1 5 which converts dihomo-y-linolenic acid to arachidonic acid, wherein said 

microbial cell or an ancestor of said microbial cell was transformed with a 
vector comprising said nucleotide sequence, and wherein said nucleotide 
sequence is operably linked to a promoter functional in said host cell. 

28. The microbial cell according to Claim 27, wherein said cell is a 
20 host cell selected from the group consisting of a fungal cell and an algal cell. 

29. The microbial cell according to Claim 28, wherein said fungal 
cell is a yeast cell and said algae cell is marine algal cell. 

30. The microbial cell according to Claim 27, wherein said cell is 
enriched for 20:3 fatty acids as compared to a host cell which is devoid of said 

25 nucleotide sequence. 

3 1 . The microbial cell according to Claim 27, wherein said cell is 
enriched for 20:4 or o)-3 20:4 fatty acids as compared to a host cell which is 
devoid of said DNA sequence. 
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32. The microbial cell according to Claim 27, wherein said cell is 
enriched for 20:5 fatty acids as compared to a host cell which is devoid of said 
DNA sequence, 

33. The microbial cell according to Claim 27, wherein said cell has 
5 an altered amount of 20:3 (8, 1 1, 14) fatty acid as compared to an 

untransformed microbial cell. 

34. A method for production of arachidonic acid in a microbial cell 
culture, said method comprising: 

growing a microbial cell culture having a plurality of microbial cells, 
1 0 wherein said microbial cells or ancestors of said microbial cells were 

transformed with a vector comprising one or more nucleic acids having a 
nucleotide sequence which encodes a polypeptide which converts dihomo-y- 
linolenic acid to arachidonic acid, wherein said one or more nucleic acids are 
operably linked to a promoter, under conditions wherein said one or more 
15 nucleic acids are expressed and arachidonic acid is produced in said microbial 

cell culture. 

35. The method of Claim 34, wherein said polypeptide is an enzyme 
which desaturates a fatty acid molecule at carbon 5 from the carboxyl end of 
said fatty acid molecule. 

20 36. The method of Claim 34, wherein said nucleotide sequence is 

derived from a Mortierella species. 

37. The method according to Claim 34, wherein said dihomo-y- 
linolenic acid is exogenously supplied. 

38. The method according to Claim 34, wherein said microbial cells 
25 are yeast cells. 

39. The method according to Claim 38, wherein said yeast cells are 
Saccharomyces species cells. 

40. The method according to Claim 34, wherein said conditions are 
inducible. 



124- 



wo 98/46765 PCT/US98/07422 



41 . A recombinant yeast cell which converts greater than about 5% 
of a 20:3 fatty acid to a 20:4 fatty acid. 

42. A nucleic acid probe comprising: 

a nucleotide sequence as represented by SEQ ID N0:1. 
5 43. A host cell comprising: 

a nucleic acid construct according to Claim 23 or Claim 24. 

44. A host cell comprising: 

a vector which includes a nucleic acid which encodes a fatty acid 
desaturase derived from Mortierella alpina^ wherein said fatty acid desaturase 
10 comprises an amino acid sequence represented by SEQ ID N0:2, wherein said 

nucleic acid is operably linked to a promoter. 

45. The host cell according to Claim 44, wherein said host cell is a 
eukaryotic cell. 

46. The host cell according to Claim 45, wherein said eukaryotic cell 
15 is selected from the group consisting of a mammalian cell, a plant cell, a iungal 

cell, an avian cell and an algal cell. 

47. The host cell according to Claim 45, wherein said host cell 
contains dihomo-gamma-linolenic acid. 

48. The host cell according to Claim 45, wherein said host cell 
20 contains EPA. 

49. The host cell according to Claim 44, wherein said promoter is 
exogenously supplied. 

50. A method for desaturating a dihomo-Y-linolenic acid, said 
method comprising: 

25 culturing a recombinant microbial cell according to Claim 37, under 

conditions suitable for expression of polypeptide encoded by said nucleic acid, 
wherein said host cell further comprises a fatty acid substrate of said 
polypeptide. 
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51. A fatty acid desaiurated by the method according to Claim 50. 

52. An oil comprising a fatty acid according to Claim 5 1 . 

53. A method for obtaining altered long chain polyimsaturated fatty 
acid biosynthesis comprising the steps of: 

growing a microbe having cells which contain a transgene which 
encodes a transgene expression product which desaturates a fatty acid molecule 
at carbon 5 from the carboxyl end of said fatty acid molecule, wherein said 
trangene is operably associated with an expression control sequence, under 
conditions whereby said transgene is expressed, whereby long chain 
polyunsaturated fatty acid biosynthesis in said cells is altered. 

54. A method for obtaining altered long chain polyunsaturated fatty 
acid biosynthesis comprising the steps of: 

growing a microbe having cells which contain a transgene, derived from 
a fungus or algae, which encodes a transgene expression product which 
desaturates a fatty acid molecule at carbon 5 from the carboxyl end of said fatty 
acid molecule, wherein said trangene is operably associated with an expression 
control sequence, imder conditions whereby said one or more transgenes is 
expressed, whereby long chain polyunsaturated fatty acid biosynthesis in said 
cells is altered. 

55. The method according to claims 53 or 54, wherein said long 
chain polyunsaturated fatty acid is selected from the group consisting of ARA, 
DGLA and EPA. 

56. A microbal oil or fraction thereof produced according to the 
method of claims 53 or 54. 

57. A method of treating or preventing malnutrition comprising 
administering said microbal oil of claim 56 to a patient in need of said treatment 
or prevention in an amount sufficient to effect said treatment or prevention. 

58. A pharmaceutical composition comprising said microbal oil or 
fraction of claim 56 and a pharmaceutically acceptable carrier. 
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59. The pharmaceutical composition of claim 58, wherein said 
pharmaceutical composition is in the form of a solid or a liquid. 

60. The pharmaceutical composition of claim 59, wherein said 
pharmaceutical composition is in a capsule or tablet form. 

61 . The pharmaceutical composition of claim 58 further comprising 
at least one nutrient selected from the group consisting of a vitamin^ a mineral, a 
carbohydrate, a sugar, an amino acid, a free fatty acid, a phospholipid, an 
antioxidant, and a phenolic compound. 

62. A nutritional formula comprising said microbal oil or fraction 
thereof of claim 56. 

63. The nutritional formula of claim 62, wherein said nutritional 
formula is selected from the group consisting of an infant formula, a dietary 
supplement, and a dietary substitute. 

64. The nutritional formula of claim 63, wherein said infant formula, 
dietary supplement or dietary supplement is in the form of a liquid or a solid. 

65. An infant formula comprising said microbal oil or fraction 
thereof of claim 56. 

66. The infant formula of claim 65 further comprising at least one 
macronutrient selected from the group consisting of coconut oil, soy oil, canola 
oil, mono- and diglycerides, glucose, edible lactose, electrodialysed whey, 
electrodialysed skim milk, milk whey, soy protein, and other protein 
hydrolysates. 

67. The infant formula of claim 66 further comprising at least one 
vitamin selected from the group consisting of Vitamins A, C, D, E, and B 
complex; and at least one mineral selected from the group consisting of 
calcium, magnesium, zinc, manganese, sodium, potassium, phosphorus, copper, 
chloride, iodine, seleniimi, and iron. 

68. A dietary supplement comprising said microbal oil or fraction 
thereof of claim 56. 
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69. The dietary supplement of claim 68 further comprising at least 
one macronutrient selected from the group consisting of coconut oil, soy oil, 
canoia oil, mono- and diglycerides, glucose, edible lactose, electrodialysed 
whey, electrodialysed skim milk, milk whey, soy protein, and other protein 

S hydrolysates. 

70. The dietary supplement of claim 69 further comprising at least 
one vitamin selected from the group consisting of Vitamins A, C, D, E, and B 
complex; and at least one mineral selected from the group consisting of 
calcium, magnesium, zinc, manganese, sodium, potassium, phosphorus, copper, 

1 0 chloride, iodine, selenium, and iron. 

71. The dietary supplement of claim 68 or claim 70, wherein said 
dietary supplement is administered to a human or an animal. 

* 

72. A dietary substitute comprising said microbal oil or fraction 
thereof of claim 56. 

15 73. The dietary substitute of claim 72 fiirther comprising at least one 

macronutrient selected from the group consisting of coconut oil, soy oil, canoia 
oil, mono- and diglycerides, glucose, edible lactose, electrodialysed whey, 
electrodialysed skim milk, milk whey, soy protein, and other protein 
hydrolysates. 

20 74. The dietary substitute of claim 73 fiirther comprising at least one 

vitamin selected from the group consisting of Vitamins A, C, D, E, and B 
complex; and at least one mineral selected from the group consisting of 
calcium, magnesium, zinc, manganese, sodium, potassium, phosphorus, copper, 
chloride, iodine, selenium, and iron. 

25 75. The dietary substitute of claim 72 or claim 74, wherein said 

dietary substitute is administered to a human or animal. 

76. A method of treating a patient having a condition caused by 
insuffient intake or production of polyunsaturated fatty acids comprising 
administering to said patient said dietary substitute of claim 72 or said dietary 
30 supplement of claim 68 in an amount sufficient to effect said treatment. 
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77. The method of claim 72, wherein said dietary substitute or said 
dietary supplement is administered enterally or parenterally. 

78. A cosmetic comprising said microbal oil or fraction thereof of 
claim 56. 

5 79. The cosmetic of claim 78, wherein said cosmetic is applied 

topically. 

80. The pharmaceutical composition of claim 58, wherein said 
pharmaceutical composition is administered to a human or an animal. 

81 . An animal feed comprising said microbal oil or fraction thereof 
10 of claim 56. 

82. The method of claim 54 wherein said fungus is Mortierella 
species, 

83. The method of claim 82 wherein said fungus is Mortierella 

alpina. 

15 84. An isolated nucleotide sequence selected from the group 

consisting of SEQ ID N0:13 and SEQ ID N0:15. 

85. An isolated nucleotide sequence from the group consisting of 
SEQ ID NO:7 and SEQ ID NO: 19. 

86. An isolated nucleotide sequence comprising a nucleotide 

20 sequence selected from the group consisting of: SEQ ID NO: 13; SEQ ID NO: 
15; SEQ ID N0:17; SEQ ID NO:l9; SEQ ID NO:21; SEQ ID NO:22; SEQ ID 
NO:23; SEQ ID NO:24; SEQ ID NO:25; SEQ ID NO:26 and SEQ ID NO:27. 

87. An isolated peptide sequence comprising a peptide sequence 
selected from the group consisting of: SEQ ID NO: 14; SEQ ID NO: 16; SEQID 

25 NO: 1 8; SEQ ID NO:20; SEQ ID NO:28; SEQ ID NO:29; SEQ ID NO:30; SEQ 
ID NO:31; SEQ ID NO:32; SEQ ID NO:33 and SEQ ID NO:34. 

88. Purified polypeptides produced from the nucleotide sequences of 
claims 84-86. 
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FastA Match of ina29 and contig 2S3538a 



SCORES Initl: 117 Initn: 225 Opt: 256 

Smith-Waterman score: 408; 27.0% identity in 441 aa overlap 
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FastA Match of ina524 and contig 253538a 



Initl: 231 Initn: 499 Opt: 401 
Stadth-Wacexnan score: 620; 27.3% identity in 455 aa overlap 
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